MIT/LCS/TR-223 


TOWARD A COMPUTATIONAL THEORY 


OF INDIRECT SPEECH ACTS 


Gretchen P. Brown 


This research was supported by the Advanced Research 
Projects Agency of the Department of Defense and was 
monitored by the Office of Naval Research under 

Contract Number N00014-75-C-0661 


This blank page was inserted to preserve pagination. 


Toward a Computational Theory of Indirect Speech Acts 


Gretchen P. Brown 


This research was supported by the Advanced Research Projects Agency of the 
Department of Defense and was monitored by the Office of Naval Research 
under Contract Number N000I4-75-C-0661. 


Massachusetts Institute of Technology 


Laboratory for Computer Science 


Cambridge Massachusetts 02139 


This empty page was substituted for a 
blank page tn the original document. 


Abstract 


The variety of surface forms that may be used to convey a given speech act pose a 
major problem in modelling task-oriented (and other) dialogues. Many such forms are so- 
called indirect speech acts, that is, surface form does not correspond to the (or one) intended 
speech act. While this topic has received extensive attention from linguists, their concerns 
have not usually been computationally motivated. In this paper, I present a non- 
computational analysis of indirect speech act forms with an eye to computational 
considerations. 


The paper is divided into two parts. Part | presents categories and rules for indirect 
speech acts, justified where possible by traditional linguistic arguments. The second part of 
the paper draws a set of computational implications from the material presented in Part 1. 
This is done within the general framework of a process model of recognition. Part 2 
contains a discussion of the basic types of mechanisms needed for the classes of indirect 
speech act identified in Part 1. The discussion includes an examination of the dependencies 
between processes and an initial categorization of the types of knowledge that must be 
considered in interpreting indirect speech acts. 


Keywords: 
indirect speech acts speech acts English dialogue 
conversational maxims semantics natural language 


computational linguistics pragmatics language recognition 


This empty page was substituted for a 
blank page tn the original document. 


Toward a Computational Theory of Indirect Speech Acts 


Contents 
A: WCIOdUCHON: 55:22 ein aan atu daca eee tan ap eee newbie oad woe 9 
Ld The -Aréaot [iterestiscs cos. feedcn gree hee gestae et itd sees Ca eRN ee eee 9 
V2 -Exctuded: Tonicssc:s 024 che ues ety dace adi eoete i anareeee yes Nee Se 13 
12 The Organization of the Paper............. ee 16 
PART |: THE TAXONOMY OF INDIRECT SPEECH ACTS PROPOSED ........ 2l 
2 Defining and Organizing SPCC AGS 4 tevanis (na ered dees ae oes i As 4 21 
2.1 Some Major Points from Speech Act Theory............. 550.00 ce ee eee ee ee 2l 
2.2 Half of an Approach to Taxonomy.............. 00.0 e scene ees Sty Meth asste 24 
co he Other Haw.) 256.0 0 bean el se lite ida phe wen de Gee eee ee 27 
2.4 The Notion of a Precondition.....................005- ded gencenaeuecees 31 
3° The Speech Acts Used in the Examples... 0.0.0.0... 0.0.0 e ee eee tees 35 
31 infonnaiionscanierea BpPOeCHACtS. jc6h ene iw eae xh ee ew Site glans: Metab arte 35 
3.2 Action-Centered Speech Acts.......0.0... 0... eee cee eee e eee ens 39 
4 Representing Procedures: The OWL-I Method..........0.... 0.002 eee eee eee 49 
5 Three Approaches to the Treatment of Indirect Speech Acts...............5--5-5- 63 
5.1 Gordon and Lakoff......... be setters essed tinier: eben eee eee 63 
GO (Sad Och. dies Ascii a aes Eis ate les ep corti mec See eee arse ts 64 
BO SCAM ceed. dey ua Pee ee et Or ee Seer eee Meee te 66 
6 Some General Rules for Indirect Speech Acts..............- 00 cece eee ee eet eens 1 
6.1 ISA Forms Related to the Maxim of Necessity.......... sinieneneeea ae Ee 


6.2 ISA Forms Related to the Maxim of Desirability................. iced garkeas 74 


6.3 ISA Forms Related to the Maxim of Possibility... .. ete ieee aaedaraees 76 
6.3.1 Forms Based on Preconditions of Individual Speech Acts...........6. 76 
6.32 ISA Forms Based on Semantic Cases of Actions......... ng wees 182 
6.4 Tense and Mood Variations...................045 = a Peers Ae fer 84 
6.5 Examples that Fall Outside the Rules...............-5 Pe ne: 
7. A Taxonomy of Indirect Speech Act Forms. ...... ia he it has vise 89 
7A Overview. v.00 eee Scan and ged (Raa tee es oe eee: .. 89 
7.2 Direct Indirect and Tag Forms............... etn ee b ene ce eee es wee ce 190. 
73 The Speech Acts Conveyed....... une aia ds adar Se eee osteo 933 
TA Proven ISA Fons cc 3 scaring ces eaaezwaninie buts Deen Pawan ee eee oP es 96 
15 Explicit and Implicit Actions. ............ 0... cece eee eee ete eee eens 100 
75.1 Must you smoke?: . , igang ELE 2 Meee eae cane Gaerne 101 
75.2 Can you reach the salt? 0... be cee cee eee eee eens 02 
FO VES COME WY NOTE oe icin oie yaaa aha We wre Liarer ee setae BAS oes 107 
75.4 Special Properties of Type 3 Implicit-Action:Forms:................. 110 
76 Summary: Implications for the General ISA Rutes......005..0..5....0.006- 2 
PART 2: COMPUTATIONAL IMPLICATIONS .......0.0.00 000 cece cence cues 7 
8 Computational Perspectives on ISAs.................000- wai eeneuuanee anaes 7 
8.1 Drawing Computational Implications from the ISA Categorization. ......... .. 107 
8.2 A Look at Related Work in Dialogue Modelling. ...... Ra ai Ted acl ee co ... HO 
9 A Framework for Recognition Processing. ............. Maisto mae is Wig Daren aad se 123 
9.1 A System Configuration for Recognition. ............6....5 gtaseaasou®, Tes 


9.2 The Event Tree and Recognition Mode. ,................ Richa aatica ehtisee le 129 


10 General Observations about ISA Recognition. .............. 20.000. e bee eee (BS 
101 The Role of the Recognition Framework. ............. ale Be Sing tad tae whey 135 

10.2 ISAs andthe Event Wee: sscihic eases eeu nadie shad eee ee Meee 137 

10.3 Some Remarks about ISA Ambiguity. .............6.. 2-062. eee ae 142 

10.4 Dependencies in ISA Disambiguation................. ARE ea ee 146 

i Outline of a Mechanism for ISA Recognition...................4- rere ee 149 
Wl Applying the Rules for Explicit-Action FORUS = Seacncbatiot agua wowe pees 149 

2 Recognizing Frozen ISA Forms... 0.20.0 tee . 158 

1.3 Recognizing Implicit-Action ISAs... 0... ......0...0.0..04820.. Ly gienmin Casas aR tas 160 

13.1 Integrating Implicit Actions into the ISA Mechanism. ............... 161 

H2.2 Recognizing Utterances with Tertiary Forces..............2-..--4- 163 

12. Knowledge Sources for ISA Recognition. ..........0 06.6 e eee eee 167 
Ie1:. Domains ‘of DiscOGrse. y2cxes fatto th ecedstn Geneve eay se eee RE SOEs as 168 

12.2 Complementary Roles of Pl and P2.. 00... 6... eee eee eee 170 

123 Properties of Pl........... Peete teen eee eee es ices Ma eRe Se SAH 172 

12.4 Properties of P2.................. Sp eydew nile cave hace Silent ite ae 172 

12.5 Exception Conditions. ................. beaten eaweee ade 173 

126 The Course of the Interaction to Date...........0......00005- puigastaigtaie gs 174 

BS Fa CONGUSION 0d chs yt'ad.ty keno ias Fa Fogg eee aGh es A ahadite tah a hose Dads 179 
RElQrences: 4 ai tii rGe vad Ou tet ethane einen innate ma mee Re Mail oh weed Sie 183 
Appendix A. Core Dialogue Methods... 0.0.00... 0... cee eee 187 
Appendix B. Matching Semantic Representations against Rules..................-. 189 
Appendix C. ISA Examples for Rules 6.1 to 6.9........0..0...0....005. sch ode? aed 193 
Appendix D: Relating Rule 7.6 to OWL-I Methods. ...............2.. 0002-2020. 201 


Appendix E: A Summary of Representation Levels in the General Recownition Model... 205 


This empty page was substituted for a 
blank page tn the original document. 


1. Introduction 


Li. The Area of Interest 


The variety of surface forms that may be used to convey a given speech act pose a 
major problem in modelling dialogue! Consider, for example, some of the different ways to 


ask someone else to write a computer program for you: 


“LI I request that you write a program to mranipsitate blocks me me. 

1.2 Write a program to manipulate:blocks forme: 
1.3 Would/Will you write a program to maniputate telecks for me? 
1.4 Could/Can you write a program to manipulate blocks for me? 
15 I want you to write a program:to manipulate blocks fer:me., 
16 I want a program to manipulate blocks writtén. - 
17 J want a program to manipulate blocks. 
1.8 I would like you to write.a program to manipulate blocks for me. 
19 I would like a program to manipulate blocks. written. . 
1.10 I would like a program to manipulate blocks. 
LH Give-me.a-pregram to manipulate blocks, -. 
1.12 Would/Will you give me a program to manipulate blocks? 

. 113 Could/Can you give me a program:to manipulate blocks? 

1.14 I want you to give me a program to manipulate blocks. 
LI5 I would tike you to give me a program to manipulate blocks.. 
116 Write me a program, would/will you? 
1.17 Write-me a program, could/can you? 


I. 1 am grateful to William A. Martin and members of the Knowledge Based Systems Group 
at the M.L.T. Laboratory for Computer Sciente. Their work provided the foundation on 
which this paper is based. The QWL-I knowledge representation formalism used for the 
process model discussed. in Part 2 and the approach to interpreting procedural 
representations are products of the efforts of this group. I also wish to thank Bill Mark, 
Candy Sidner, Peter Szolovits, and Jim Weiner for helpful comments on drafts of this paper 
and. Ashok .Malhotra, William Mann, and: Louis Melli for making er npr 3x 
available to me. 


Utterances 1.3 to 117 are commonly called indirect speech acts, since the surface fort dees not’ 
correspond directly to the intended speech act. This is not a complete list but is intended 
only to illustrate the variety of forms that are possible. Similar lists COURT Be trade for other’ 
a ae acts. cuss seem to have the ia Sig mente of possible aly but Saniee 


3] SES a PsP ¥ 


suggestions, and que also > appear ina finebs of variations. In faa, all speech acts © 


with the exception of some ceremonial acts seem to have: at least some indirect forms. 
For a theory to adequately deal with ndiredt tals atts Weckn. ISAs) it must 
contain at least the following ik peal 


1 A characterization of the fossa that- oo be aed to convey 
individual speech’ mts 


2. A description of rrcanisms) to relate surface forms t to. 
aledler ying speech acts? 


3. An account of the conversational impiations of choming 
a particular surface form 


Parts | and 2 of this paper relate, respectively, to me first two of there goals In Part 11 
present a group of distinctions amenees as the core ot a enemy’ of TSA forms (Goal 1). 
The | categories seers draw or J insights from, but ‘Rot -mecesaity the’ theoretical 
framework of, previous work in this area (eg., Sa@urk; Searte; sin ‘Gorton and Lakoff). 


The categories differ in important ways, however, from existing proposals. Akhough the 


2. | hope that the similarity between this abbreviation of indirect speech act and IS-A, a . 
name used commonly in the Artficial Invelfigence Nterature fora hierarchic semantic 
refationship, is not too CELA: The me of as are ve pee: fot as 
an acronym. 


3. Here and. elsewhere in the paper. | use surface form to mean ‘ining a written or spore 
English utterance. 


choice of these categories was strongly motivated by computational considerations, they are i 
supported by independent linguistic evidence as well. This use of -non-computational 
evidence to justify computationally attractive categories is-a methodological strength of the 
presentation. 

Part 2 of this paper focusses on the second area cited above. There! concentrate on 
‘econ and describe aspects of an ISA process model. ISA recognition cannot be 
completely divorced from general issues of recognition, and I have set the process modef 
within a larger framework. ISA processing: can, however, be viewed independent of a 
particular implementation, and that has been done here. 

The third area cited“above, accounting for the conversational implications of choices, | 
will not be given.a systematic treatment here due to the scope of the problem. Some work 
in this area has already been done, eg., that by Lakoff [15] and Davison [7]. ‘ 

My account of ISAs will be restricted to task-oriented dialogue, which was defined by 
Grosz as “one in which two (or mere) people communicate for the’ sole sineisct 
compteting some. task” {814 1 would extend this definition ‘beyond ‘people to include 
communicating systems (i.e. computers). A good model of task-oriented dialogue would 
have obvious practical advantages in the realm of human-machine interaction. Beyond 
this, task-oriented dialogue has some methodological advantages. First, the. task being 
carried out not only gives structure to the dialogue but-also gives valuable independent 


information to the outsider trying to interpret dialogue transcripts. Information about the 


4. Although this definition opens up the possiblity of more than two pasiegants: 1 will 
restrict my attention to two- party conversations. 


$2:. 


task can be particularly usefat-in. ascribing goals. and motivations to dialogue participants. 
Second, the conversational implications of choosing. a particular ISA form (Goal 3, abeve) 
are generally less crucial to task-oriented dialogue than, say, to the therapeutic dialogue» 
analyzed by Labov and Fanshet [14] This does not mean that the implications of choices 
can. be ignored in an ISA model, but it-does mean that work on modelling task-oriented | 
dialogue can proceed without waiting for a full systematic treatment of the implications of 
choices. 

In developing this treatment. of ISAs, } have concentrated on:the areas of similarity 
between typewritten and spoken utterances, which means that I will have little to ss ‘about 
features unique to speech. eg. intonation. Certainly more work on the place of special 
speech mechanisms is warranted. The commonalities between written and-oral modes are a 
good place to start, however, because there is a. good deal-of lexical and grammatical 
Variation to be accounted for in ISAs. 

Two additional remarks: first, bangs are drawn from several:different sources. A 
number of examples are drawn from the ISA literature, and some have been taken from 
transcripts of both typewritten and. spoken dialogues. The rest of the examples are 
manufactured ones, to avoid the necessity of long scene-setting explanations and to more 
‘icity highlight the phenomena that are of interest. .Akhough | have been able to find 
counterparts for.some. of the manufactured. examples in dialogue transcripts, this has not 
been the case fer.all of them. In general, I leave the empirical justification of the proposed 
ISA framework to further research and appeal instead to the linguistic intuitions of the 


reader. 


The second remark relates to representation. A’ major focus of this paper is on rules 
and pieces of semantic representation that together, I claim, can be: shown to account for a 
large variety of ISA forms. The rules and semantic representations are presented here in 
informal English, although they have been translated into a knowledge representation 
scheme called OWL-1. J feel that semantic representations in general are still in their 
infancy. we still have only an imperfect-idea about what elements should be included in a 
semantic representation and why. Because of this, little would be gained by the use of the 
notation here. The loss of precision in using English is hopefully offset by the greater 
intelligibility of the presentation and the obvious applicability of the ISA rules across 
representation schemes. 

1 am, then, considering ISAs in task-oriented dialogue, and 1 will be working 
primarily with the commonalities that exist between written and ‘spoken forms. Before a 
more detailed survey of what the paper will include, I will fist some of the areas that will 


not be considered. 


1.2. Exctuded Topics 

There are two sets of topics that will be excluded from consideration. The first relates 
to phenomena that I do not consider to be ISAs. The second ‘set of topics relates to 
legitimate ISAs that will be ignored in an effort to limit the scope of the paper. 

We start with the non-ISAs. ISAs will be categorized by their intended underlying 


speech acts, so this excludes speech acts that are not intended by the speaker/sender, pid 


5. To simplify discussion of dialogue exchanges, I will call the speaker/sender of the first 
utterance considered Pl and the hearer/receiver P2. If subsequent related utterances are 


4. 


For. example. consider the case where PI intends.a simple description of a personal wish but, 
on hearing -Pi's uterance; P2 treats it as a-request and sets about to. make the wish.a reality. 
Whatever the resuk of PFs utterance, if Pl did not intend the: utterance asa request, then. I. 
wilt not-consider it one... 

Another-sort of utterance that will not be considered to be an ISA is one where there 
is an additional implicit sense-arisiag only from a. particular cooccurence of dialogue steps. 
For example,:in.a computer console session environment, users sometimes type “Thank you” 
in a place where other. users type “Thank you” followed by “Good-bye”. I do not consider 
"“Fhank you",-when. it occurs alone .in stich a situation, to be an indirect closing. Instead, I 
consider the closing to be an optional step, which may be omitted: in the presence of 
utterances that uniquely identify the place in the dialogue. A similar optional step would be 
@ statement acknowledgment such as “I understand,” “Uh. huh” or “O.K.” If the next — 
utterance is one that. shows that the statement has been understood, an acknowledgment is 
not strictly necessary. The criterion I am using to distinguish ISAs from utterances 
preceded or followed by omitted optional steps is that ISA forms are derivable from 
conditions associated with the conveyed speech act(s). Utterances that “imply” omitted .steps do 
so based on relationships at a level of dialogue structure more aggregate than speech acts© 


Consequently, just because one utterance UI implies another, U2, this does not mean that Ul 


discussed. then Pl and P2 continue.to refer:te the same participants. The only places that ! 
' depart from this convention is in reporting on other theories. In these cases, I follow the 
terminology used by the author of the theory. : 


6. See Section 4 for a discussion of aspects of this level of dialogue structure. I suspect that 
application of the framework described there could ice to good progress in characterizing 
the conditions that permit omitted nee 


should automatically be classed as an ISA with the force conveyed by U2 as an underlying 
speech act 

This takes care of forms that will not be considered ISAs, and now I briefly list 
phenomena that, although interesting, are beyond the'scope-of this‘paper. The fifst of the 
excluded phenonrena are cases in which rules or conventions: are violated. Examples are 
deliberate violations such as insincerity and accidental ones such as mistakes. In addition, I 
will not discuss what 1 consider to be second ofder uses of speech acts, such as sarcasm, 
jokes, or failure to make standard choices (eg, Pl-makes an utterance and has not decided 
whether it is a question or a request for a nonverbal action). Finally, 1 have exchuded one 
particutar class of ISAs, those identified by Fraser as hedged performatives [9} This class 
of ISAs, atthough extremely interesting, seems to be of relatively mitvor importance for the 
type of dialogue I am considering. 

A final area that is considered only partially is context. Linguistic and situational 
context plays a major rote in both the generation and the understanding of: ISA forms. The 
role of context clearly cannot be ignored. At the same time, a full account of the nature of 
context is a massive undertaking, and one that has peta far ae bales ISA forms. 
This. paper, then, contains tess: than the full: story on. context, eee I fave given it a 


relatively large amount of attention. See in catiatar Section #2. 


7. One interesting second order use of ISAs is in what can be called force shift. In these 
- cases, one speech act form 1s used to “masquerade” as another. For example, one may use a 
suggestion form such as “How about picking up the:blécks now?” in an environment where 
authority and role relationships make it clear that the utterance is functioning as a 
command. In general, force shift seems to be used to give Pl the appearance of greater 
benevolence or to save face for P2. 


1.3. The Organization of the Paper 
I will finish this introduction with a brief description of the organization of the 
paper. Part I begins with a :description of the major points of speech act theory that have 
affected the model presented here. In Section 2.}.am particularly concerned with issues of 
taxonomy, since an account of ISAs depends en clear boundaries between classes of speech 
acts. Section 2 gives a:detailed view of the speech acts-used, as examples throughout the 
paper, and Section 4 intreduces the OWL-1 method, a. structure for representing actions. 
The preliminaries aside, Sections 5 through 7 focus-en ISAs. Section 5. discusses three 
different theories of ISAs, and the next two sections present.a theory that builds on all three | 
of these, while also.accounting for additional phenomena..in particular, Section 6 presents: a 
set.of general. ISA.rules. Section 7 shows how these rules must be restricted and augmented 
to account for the ISA data. This section identifies a set of categories that account. for some 
of the major.differences.found among ISA forms,’ 
The computational implications of the ISA taxenemy:are developed in Part 2. The 
central thesis of Part 2 is as follows: 
The phenomenon of indirect ieee acts is too stage to admit to a single, 
_ uniform computational treatment. It is necessary, then, to identify classes of indirect 
speech acts that share similar | computational properties and use euerety: 
representations and processing strategies. for the different classes... 
By no coincidence at all the candidates for differing computational treatment are exactly the 
categories developed in Part |. - 


While. recognition. of ISAs has its own unique aspects, it cannot be viewed entirely 


separately from the whole problem of recognizing utterances. For this reason, after some 
additional introductory remarks in Section 8, Section 9 discusses the points in the 
recognition process that have the most direct relevance to the processing of ISAs. Section 10 
presents general observations on ISA processing in the light of the recognition framework 
that was described, and Section fl contains specific proposals for a een Finally, 
Section 12 is an initial characterization of the knowledge sources that come into play in 


framing and interpreting ISAs. 


This empty page was substituted for a 
blank page tn the original document. 


PART 1 


THE TAXONOMY OF INDIRECT SPEECH ACTS PROPOSED 


This empty page was substituted for a 
blank page tn the original document. 


2t 


2. Defining and Organizing Speech Acts 


2.1. Some Major Points from Speech Act Theory 

Any treatment of ISAs necessarily. rests on a general..theory. of geet acts. 
Fortunately for the theory, but unfortunately for would-be simrairizerd: the speech act 
literature is extensive. Since I cannot hope to adequately summarize it here, I will mention 
only those aspects of the theory that are reflected dicecty in the account of ISAs to be given. 
Readers unfamiliar with speech act theory are referred to Austin [2], Searle [29], and, fora 
wide-ranging bibliography, Verschueren [34]. More specialized references will be cited in 
the course of this section. : ars 

The central thesis of speech act theory is that a strict dichotomy between speech and 
action is untenable. Before Austin, much of the philosophical interest in language centered 
around its descriptive uses and the conditions under which statements could be judged right 
or wrong. While acknowledging this constative use of language, Austin identified a class of 
utterances which he called performatives. Standard examples of explicit performative 
utterances are: 


21 I name this ship the Queen Elizabeth. 
2.2 1 bet you sixpence it will rain tomorrow. 


Once 211 is uttered under the proper circumstances, the ship officially. has a name, and, once 
2.2 is uttered, a bet has been proposed. The important point about these and other 
performatives is that saying something is also doing something. 


Austin’s central insight that language is action was taken up and developed further in 


22 


the work of Searte. In [29] Searle identifies five typés. of constitutive rales for speech acts. 
Constitutive rules create or define new forms of behavior, as opposed to regulating or 
sanctioning behavior. Searle's five types of rules, when taken together, are intended to give 
necessary and sufficient conditions for performance of-a speech act. ‘The five types of rules 


are: 


1. Rules shared. by all speech acts are those based on normal input and output 
conditions, which Searle deseribes’as “the range of eo vader which any kind 
of serious and literal poeuitic communication is is possible 


2. Propositional content rules describe constraints. on: the propositional content: of 
speech acts. The propositional content of a speech act is, very informally, what the 
speech act is “about.” For example, the propositional content ofa request is “a future 
act A of H.” (H is Searle's abbreviation for hearer. Ss. which mee below, stands for 
speaker.) 


3. Sincerity rules are those based on conditions that must occur for the speech act to © 
be performed sincerely. The ‘sincerity rule for request is-that the request is uttered 
only if "S wants H to do A.” 


4. Preparatory rules dre those based ‘on, conditions that mast obtain initially for the 
speech act to be performed successfully. An example for request is that the request is 
uttered only if “H is able to do A.” 


5. Essential rules are based on what Searle'sees as the essential feature of the speech 
act, one that distinguishes it from other similar speech acts. For a the essential 
rule is that the request “cotints a¢ an attempt to get H te do'A.” . 


Throughout the work on spech a acts, » there is a basic tension between ance Y's 


speech acts and the words used to a out or describe them, comment called speech act 


8. [29}.p.57. 


23 


verbs (henceforth SAVs). Verbs such as ask, request, congratulate, promise, apologize, etc. 
can be used to carry out the action conveyed, while other verbs such as convince, persuade, 
and intimidate are used descriptively? Austin offers a preliminary taxonomy mot of speech 
acts. but of SAVs. Searle, who presents a taxonomy.-of speech acts (see (38), points ‘out that 
different SAVs do not necessarily convey different speech -acts;.some SAVs, for example, 
differ only in the manner of carrying out the speech act. Searle's example, which is 
somewhat controversial, is announce, since one can announce orders, promises, and reports. 
Another example might be the use-of remark and emphasize to reflect differences in.delivery 
of, potentially, the same information. 

What are the implications of these ideas for a computational theory?.. First, the view 
that speech. is action means that, as a minimum, speech acts should: to be represented in a 
way that reflects the commenalities with other sorts of actions. At the same time, the 
representation must: be able to reflect the special properties of speech (and. written 
communication) that distinguish speech acts from other sorts of actions. Finally, we want to 
maintain a distinction between speech acts and SAVs as a source of information about the 
boundaries of speech acts. All of these themes will play an important role in the treatment 


of ISA forms presented here. 


9. This latter group of verbs focusses not on the speech act (the illocutionary act), but on the 
effects of the speech act on the hearer. Austin calls the acts conveyed by such verbs 
perlocutionary acts. It seems to me to be desirable, however, to emphasize the commonality 
of the speech act between, say argue and convince. These refer to the same type of act, but 
for the latter, the effect is descibed as well. See [6] for a similar treatment of perlocutionary 
acts within a computational framework. 


22. Half of an Appreach to Taxonomy 

_ Austin. estimated the number of SAVs: to ‘be between’ One and tefi thousand. 
Assuming that the number of speech acts'ts also of this Ofer; then’ tlearly a taxonomy ‘is 
essential to a general theory of speech: acts. ‘Tt turns out thiit’ a°taxonomy, or at feast a 
principled approach'to one, is alo crucial to a treattnetit of ISA’ forms. If one wishes to 
chim that forms associated with one speeth ait are’ wad to convey: wither, then one must 
be quite sure that the two speech acts are really distinct. Otherwise, the “indirection” is no 
indirection at-alt. 

In this subsection I present an approach to taxonomy that combines aspects of earlier 
work with: some new additions. Before plunging?in: 1 with to empliasize that this is not 
seen as the only speech act organization ‘postibte: ior is it ‘seen ‘as the ‘ofily ‘computationally 
usefal-one. itis, however; one: that plays a significant rele an ISA organization. Séktion 3 . 
presents additional speech act categories that will be-used in the ISA analysis. 

For-the basic approach to taxonomy,-} take a‘ high: level set of categories from Searle 
but use.a defining criterior derived from ‘work by Verschueren. At the most general level 
of speech act categories, | start with the five chassds itertified: by Searte {31} representatives, 
directives, commissives, expressives, and declarations. Searle bases'these five’ classes on the 
"iHocutionary point” of the act, that is, on “differences in the point (or purpose) of the (type 
of) act.” Because of the importance of these speech act classes, I will give Searle's definition 


of each, excerpted from [31] 


25 


Representatives (eg., asserting): The point or, purpose of the members of the 
representative class is to commit the speaker (in varying degrees) to something's being 
the case, to the truth of the expressed proposition... : 


Directives (eg., requesting). The illocutsonary point of these consists in that fact that 
they are attempts (of varying degrees and hence more precisely, they are determinates 
of the determinable which inchides attempting) by the speaker to get the hearer to do 
something. 


Commissives (e.g., promising): Commissives..are those illocutionary acts whose point is 
to commit the speaker (again in varying degrees) to some future course of action. 


- Expressives (eg., thanking): The illocutionary point of this class is to express the 
psychological state specified in the sincerity condition about a state of affairs specified 
in the propositional content. 


Declarations (e.g.. nominating a candidate) It is the defining characteristic of this class 
that the successful performance of one. of its members memet. about the 
correspondence between the propositional, content and reality... 
These five classes give an excellent start toward a computationally useful taxonomy of ISAs. 
The notion of illocutionary point, however, has been criticized by Verschueren [35] for its 
combination of iecudonsis (ie, act-related) and perlocutionary(ie., effect-related) 
properties. (The definition of directives mentions the hearer explicitly, byt other categories. 
do not.) Verschueren views SAVs as reflecting the speaker's intention to bring about some 
effect in the hearer by pronouncing an utterance. : 
Verschueren’s focus on the hearer can be profitably applied to speech acts as wel as 


SAVs,; the one addition that I would make is that some speech acts can have more than one 


intended effect on the hearer, so that the classification criterion must be according to the 


ox 
principal effect that the speaker (P1) intends to have on the hearer (P2). Using this 
approach, Searle's five categories can be redefined as foltows0 


Representatives have as their principal intended effect that P2 come to know that Pl 
believes something. it 


Directives have as their principal intended effect that P2 take responsibility for — 
carrying out some action. 


Commissives have as their principal intended effect that P2 accept PI's commitment 
to take responsibility for carrying out some action. 


Expressives have as their principal intended effect that P2 come to know that Pl feels 
something. 


Declarations have as their principal intended effect that P2, and possibly society as 

well, come to perceive and accept a change in reality. 

We have, then, Searle's speech act categories and a defining criterion based on a 
suggestion by Verschueren. The criterion is suited to a computational approach because it 
points in the direction of larger patterns of action. It takes into account both participants in 


a dialogue and foreshadows responses to the speech act (1e. whether the speaker's intention 


10. The first three criteria are closely patterned after similar ones given by Verschueren in 
35} 


It. T use “come to know” rather than “know” here to indicate that I am defining the principal 
intended effect as an action rather than simply as a state. _Expressives, commissives, and 
declarations are given t the same tréatment. 


12. I use take responsibility here to indicate that P2 may either carry out the action or 
subcontract some or all of it. 


27 


is satisfied, is not satisfied, or whether the feipomas Tall somewhere in between). 1 wit! come 
back to these issues in Section 4. 

This takes care of the top of the speech act taxonomy. Besides its use in forming the 
most general categories, the “principal intended effect" criterion can also be used to form 
intermediate levels of speech act classes. Although I have found such categories useful for 
other purposes, they are not necessary for the ISA analysis. meretj mention their 
possibility and go on in the next subsection to the.other half of the taxonomy, the question 


of -how to distinguish speech acts from each other. 


2.3. The Other Half 

This brings us to the level of speech acts 3 Despite all the work that has. been done 
in speech act theory, 1 believe that we still: lack a totally satisfying answer to the question of 
how one speech act is to be distinguished from another. Searle's constitutive rules give a 
way to distinguish speech acts from other sorts of actions and give a -framework for 
expressing distinctions between speech acts, but they do not specify exactly where lines are to 
be drawn between. different speech acts. Fhe "principal intended effect” criterion gives a 
necessary condition for differentiating between speech acts, but it does not seem to be 
sufficient for the purposes of studying ISAs. For example, warn and command are generally 
considered to be different speech acts, but both can be viewed as.attempts by PI to get P2 to 


take: responsibility for carrying out some action. Granted, warnings as a class have the 


13. I will refer to generic speech acts as, simply, speech acts. Realizations of a generic 
speech act will be referred to as particular speech acts. 


extra restriction: that the actien is: to avoid: some condition that is perceived by Pi to be 
against P2’s best interests. There is nothing to stop one, however,-from commanding the 
same sort of action. Phus;-an utterance ‘such as 2.3, might be consttued 2s a warning or a 
command, depending on whether authority or "geod sense" is being appented to. 

23 Keep off the ie. 

At such bordertine cases, the difference seems to rest on the nature of P2's obligation to take 
responsibility for the action. Warnings seem to-becempawered pan -ippeat 86 P2 toradt 
reasonably, i.e. in P2's own interest. Commands are. empowered by the authority 
relationship that exists between PI and P2. If command and warn must te distinguished 
based :on the nature: of-the: retationship appealed: to, their thie’ “principal: intended. effect” 
criterson. 44. clearly net sufficient. 1 need;ahen, 2: principled:appronih fer the speech act tevel 
of the taxpnomy.- 

I start-the-search for a- criterion with the. conviction that the resulting set. of speech 
acts shoutd took familiar. They should not be radically different from the categories found 
in the literature. and,: more :inasica iy; those ‘reflected :im English and ‘ether ‘languages ‘The 
resuking system should allow the use of famitiar terms: such ns wetn, command, request, ask, 
tell, etc. 

-’ After classifying a: number of speech acts, 1 have found -no one simple criterion that 
yields the familiar set of speech acts: There are; hawever,: three guidélines::for what 


constitutes a speech act that I have found useful: | 


1. SAVs suggest a minimal group of speech acts, if one omits: 


a. general SAVs such as say, which can refer to more than one speech act (where © 
“speech act is delineated by application of some other paucetine! 


b. context-marked SAVs such as reply, oe and answer, which mark the ba sas 
of the utterance-in the discourse!# — - 


c. aggregate SAVs such as describe which tefer to more than one speech act acting 
as a unit. ; 


d. SAVs that differ only according to PI's view of the irnporiante of the speech a act, 


eg., mention used for stating. 


2. Sets of linguistic realizations that are substitutable for each other without seriously 
disturbing the flow of a dialogue should have a separate speech act with which they . 
can be associated. 


. Since particular speech acts occur as part of larger linguistic and: non-linguistic 
activities, speech acts should be chosen so that they are easily integrated into these’ 
larger patterns of action. 

a No matter how refined these guidelines become, they probably cannot be made to 
function independently. That is, no one of iesa “ean turned ed a criterion that is 
sufficient to clearly discriminate pelerh all the speech acts that one would expect. With 
respect to SAVs, at least in English there are csndidstes for the status of speech act that 
have no corresponding SAV. For example, there is a separate set of surface fortis for 
giving instructions, and so, according to: guideline:2, one would want-to identify a separate | 


speech act. There. seems to be no English SAV that expresses: the notion of giving one- 


14. In barring the referents of context-marked SAVs from independent speech act status I 
differ with Searle (31). The difference is not particularly serious and means only that the 
class of independent speech acts that I am admitting is slightly smaller than his. 


® 


utterance-worth of instructions. (/nstruct is either a request or an aggregate.) SAVs, then, 
provide only a. minimal guideline. 

The auiiCUny with Guideline 2 comes in narrowing down ne fation of “seriously 
disturbing. the flow of dialogue,” since ¢ probably nod utterance forms are ever completely 
synonymous. The choice of one form over another. seems to: always. involve some 
implications, e.g., politeness, formality, combativeness, etc. There seems to be ; grey area 
between characteristics of utterances that should be accounted: for by “implications of choice” 

-and those that should be accounted for by assuming a difference in conveyed speech act. 

With respect to the third faidehina? “ease of integration” is. stmost certainly not a 
simple property. The process of representing larger patterns of action s open to influences 
beyond those local to the speech act. Frequently, altogether different sets of speech acts can 
be Siipported if different choices are made at other places in the representation. (For — 
further discussion of larger patterns of action, see Section 4.) 

Even though these guidelines cannot function independent and in some cases they 
wilt stilt feave some grey areas, they can stilt have important implications for a 


computational theory. A set of speech acts that follows the guidelines will have the 


folowing properties: 


1. The speech acts will be easily related to English SAVs. 


(2. The.speech-act representations: will provide a .useful- place to associate information 
about groups of linguistic realizations. 


3. The speech acts will fit into larger patterns of action, allowing us to model not only 
isolated speech acts but also speech acts as they occur in dialogue. 


The speech acts used in rules or examples in this paper are introduced in Section 3. They 
represent an application: the three guidelines given here. - After working with them for some 
time, I believe that they. display the three advantages listed above. These advantages begin 


to be apparent in the discussion of the computational model in Part 2. 


2.4. The Notion of a Precondition 
In defining speech acts with ISAs in mind, it is useful to shift the enpnatts a a in 
the treatment of constitutive rules. I will therefore introduce the notion of a precondition of 
a speech act. Preconditions are a combination of Searle's sincerity conditions and . 
preparatory conditions viewed from the point ee view of pps (where PI is the agent of the 
speech act). Because Pl’s point of view may not correspond. to reality, particular speech acts 
may meet all preconditions and still be defective’ in the sense used in Searle [29]. The 
' preconditions that I am interested in are those that differ from.one speech act to another. 
For this reason, I will ignore general requirements such as the requirement that a given 

speech act must be intended by PI. 
Although the actual representation that Iam assuming for the preconditions is OWL.- 
1, for the purposes of this paper J present the preconditions in informal English. ‘Despite 
the informality, some words are used in a restricted sense and several simple conventions 
have been followed. 1 will discuss these conventions and some of the restrictions: here, and 

_the rest of the restrictions will be explained as examples are presented. 

15. Searle's essential conditions are not directly reflected in preconditions, although they seem 
to be derivable-from them. | see essential conditions as an: amalgam of several different 


types of information, some of which would be explicit in preconditions and the rest of which 
would probably be implicit. 


32 


The first convention is that Pl in preconditions refers.to the speaker/writer of the 
speech act and P2 refers to the hearer/reader.. Speech: act conditiens: centered .on intentions 
of PI (in. the: philosophical sense of intention). will have pteasnditions that corvespend 
directly, ie., 

PI <intention> <object of intention> 


eg., Pl wants some action done. 


Alt other speech act conditions are recast with respect to PI's model of the world, i.e., 
Pi believes <speech act condition> 


eg., Pl believes that P2 can take responsibility for a (particular) action. 


This takes care of the conventions, two restrictions are also of sie want and 
believe. For both types of precondition, I have represefited goals of participants by the verb 
went, ¢e., “Pl wants <state or. action>" or "Pl believes that ®2 wants <state or action>”. 
Believe is ‘more complicated. As used in preconditions tt is broader than Anow; that is, the 
direct object of believe may be either a fact or an opinion. : By "Pt- believes X" 1 mean that 
Pi has some world model that contains X and: Pt believes that ‘X corresponds to an aspect 
of reality. By “correspond to an aspect of reality” 1 mean that:actions based on X reach the 
expected results or, if they do net, the deviation from the expectation is not caused by X. 
__ As the philosophical fiterature attests, much more remsins to be said about facts, opinions, 
and reality. To avoid infinite -- or at least lengthy — regress, 1 will cut off the discussion 
here and appeal to the reader's intuitions. | 


The notion of a precondition is operationally useful-in a computational theory. The 


3 


preconditions of a speech act are groups of conditions that are expected to obtain before the 
speech act is initiated. As such, they should be intuitively obvious to speakers of the 
language. One pair of linguistic tests for “preconditionhood” relate to the way that P2 may 
claim that a failure has occurred in PI's speech act. For speech acts centered on intentions 
of Pi, P2 may reply with form 2.4. For others, form 25 is possible. 


2.4 You don't really <intention>. 
e.g.. You don't really want me to do that. 


25 What makes you think <direct object of initial "PI believes">? 
e.g.. What makes you think I can tell you? 
Note that these tests are necessary but not sufficient to distinguish the preconditions 
particular to a speech act. They also apply to conditions associated with speech acts in 
general. (See Section 6.3 for some of these general conditions.) 
Several examples of the sets of preconditions associated with different speech acts are 
found in the next section, and the usefulness of the concept of a precondition for ISA 


analysis will become apparent in Section 6. 


This empty page was substituted for a 
blank page tn the original document. 


3. The Speech Acts Used in the Examples 


In this section we take a — look at four our commonly occuring speech acts. These 
will be used as Sane nae the paper. The pies aie are diawn from two of 
Searle's categories that are particularly relevant for task-oriented diahgue directives and 
commissives. (Representatives, also very snipoitai in task-oriented dialogue, ai are treated as a 
class for reasons given below.) — 

The four speech act examples are presented h here in terms of si additional categories, 
which will be useful later on in the RES SHOe of the ISA categories: (Recall the warning 
that Searle's five categories were seen as one of scietal useful divisions of speech acts.) The 
split that | am making is on types of propositional content. The two speech act classes will 
be called information-centered and action-centered speech acts, they are defined in the fine 
siibsections below. Note that the two categories do not cover all the speech acts in Searle's 
five categories. While it would be possible to extend the alternative estagariialion scheme, 


this is not necessary for the purposes of this paper. 


3.1. Information-Centered Speech Acts 

Information-centered speech acts are ihe whose intended effect is the transfer of 
jiianaieh between Pl and P2 with no additional baiticipants involved. Information- 
centered speech acts include Searle's Api Nase aca and other F types of cin acts eau as 
the directive ASK. 16 For the purposes of this paper, 1 will need only ASK. While 
16. Here and elsewhere, | will capitalize the names of speech acts. that are used to = to 


the speech act definitions given in this section. Small letters: his be used ‘for references to 
other definitions. 


representatives will play an important role in many aspects of the ISA_analysis, Iwi not: be 
focussing on ISAs of individual representatives. The aga of canes distinctions 
between individ tepresentatives rales many i interesting ‘nid, important Philosophical 
issues, which are, unfortunatly, ai the ag of this nas | 
| The + propostional content and the precondition of ASK are stated informal in 
Finnie 31. We have or that ASK is a “directive ie, an attempt by Pi to a ic to take 
responsibility for carrying out some action. In this case, the action desired is that P2 give 
PI ihe sitarrialion: the answer toa question !7 As 1 have defined ASK, it requires that Pl 
Sa in sae (precondition I). To be consistent, Pi must also believe that he es she does 
not know the answer; this is part of the definition of want. The first spetbnaliion: then, 
means that certain types of questions are not classed under ASK. Among these are test 
estat and ceremonial questions where PI already knows the answer. | 
The preconditions in Figure 3.1 should be self-explanatory, with the possible exception 
of (V). The notion of obligation ised here is a ‘are specie version of the generalized 
obligation that Labov and Fanshel use for requests in U4] (I am using the same set of 
obligations for REQUEST as for ASK; further discussion of the relationship between these 
two speech acts is given below.) An example of an obligation arising from complementary 
oe oui be the obligation that. applies in an interviewer/interviewee relationship. 
Authority sehiateen are slightly more difficuk to identify, since, especial in contemporary 


American society, most authority arises from roles. Still, it is possible to identify authority 


17. 1 appeal to intuitive notions of question and answer here, akthough much more can -- 
and eventually should -- be said. It appears that a specification of the semantic structure of 
questions and a full treatment of the semantics of answers will be sizable tasks. 


37 
Figure 3.1. The preconditions for ASK 
propositional content: some question 
IL PI wants to know an answer to the question. 
Hf. Pl believes that P2 can tell an answer to the question. 
lik. Pl belteves that P2 is willing to tell an answer to the question. 
IV. Pl wants P2 to tell PI an answer to the question, 


Vv... PI believes that. P2 has some obligation (a-role obligation, authority obligation, 
or general obligation to be helpful) to Pl to tell Pl an answer to the question. 


Comments 


(1) Want here and in (IV) implies that PI does not already know the answer. The case 
where PI does know and merely wants to know if P2 knows (and where P2 knows that Pl 
_ knows) is classed as a different speech act. 


(I) Know is considered to be a restricted form of believe; while anything can be believed, 
only facts can be known. (For ASK, the “fact” is that some proposition is the answer to the 
question.) Fact, of course, needs ‘its own definition which | de‘not:have space to go into 
here; instead, | appeal to intuition and observe that facts are most strongly contrasted with 
opinions. mots < 


{I1)-(V) Tell is used here to mean “utter a representative.” 


(V) More than one of these different types of obligations may apply at once. 


38 


obligations that, while they may be carry-overs from roles adopted By the: participants, do 
hot arise from current complementary role thee For ae M a } cenemer 
encounters a stadent in the halls of the school and the teacher asks a question, the student is 
obliged to answer by an authority obligation. - Certainty; the tesefier's authority can be seen 
as arising from the role, but, in this context, ig student need not. —" be a student of 
this teacher for the obligation to ‘appl Thus, the authors obligation stems from each 
participant playing:some rote, but the: role refationship-is ‘riot complementary’ in the same 
way that it is for teachers and. students in a classroom situation. 

Winding up the discussion of obligations, we come to the Senull obligation to be 
helpful.” beste new here Saladinidheies ee Legh bail chrsbiesiee tannatei! individual's 
goals. Biariples of gascuie to this Sobgstion would be 2 an ASK to a stranger for the time 
or an ASK to a friend for the football scores. This type of obligation seems to arise simply _ 
from me inequality of aati 28 between the Abie asia if P2 knows soeetang that Pl 
wants to know, then P2 is obligated to "be. Saal awd shate the knowledge This 
obligation is not absolute (nor are id or authority cba) since it may be overridden 
by other obligations. 

To finish up the pope of ASK; ‘Rete: that “ASK: shows et siriaities to 
REQUEST, which is diacussed in the next subsection REQUEST is ‘wed priemartiy: for 
nonverbal actions. The distinction between — ant eee Ld Fascias on 1 the second 
and third guidelines from Section 2318 With respect to Guideline 2, ASK has the 
18. The first guideline sean fairly ee in this oa Tine, ihe SAV ak ae be used 
in the following two patterns: “asking someone something” and “asking someone to do 


something.” These two do not, however, correspond completely to a distinction between 
describing an ASK and describing a REQUEST. For example, “I asked Harry if he would 


39 


interrogative as one of its direct forms (what | am calling its simple form; see Section 7.2). 
REQUEST has only the imperative as a comparable form. With respect to Guideline 3, we 
see differences in the patterns of action in which ASK and REQUEST take part. One 
example is the following pair of responses: 

34 I can't. 

3.2 I can't tell you. 

Here, 3.1 can be a response to a direct (ie. imperative) REQUEST, but it cannot be diel 
synonymoustly with 3.2 as a response toa direct (i.e. interrogative) ASK. 

All this is not meant to deny that ASK and’ REQUEST are closely related. 
Cemparison of Figure 3.4, below, with Figure 3.1 shows strong similarities in the 
preconditions of the two speech acts. These strong similarities do not, ‘however, necessarily 
justify viewing ASK as a kind of REQUEST. fstead, thé similarities can be seen as a 


result of the common membership of ASK and REQUEST in the directive class. 


3.2. Action-Centered Speech Acts 

An action-centered speech act is defined as a speech act that has as its propositional 
content an action Al of a type to be specified where Pf wants Al to occur. Here, Al is either 
non-verbal or a verbal action directed at someone other than P1!9 
lend me a pencil” may be used te report a REQUEST, and “I asked Harry to tell me the 
answer” may be used to report an ASK. 
19. Besides the normal “active” actions I include as actions the maintenance of a state, eg., 
being noisy, and the avoidance of an action, eg., being quiet: The definition of action- 
centered speech acts is meant to exclude verbal actions such as Pf asking P2 ‘for information 


but to include Pi asking P2 to give information to a third’ party, P3. The first case is 
already covered by ASK, as defined above. I consider the second case to be action-centered 


40 


Action-centered speech acts, then, are acts that attempt to generate an action, not 
merely. refer to one. (as in, for example, thanking someone for:some action). Note that the 
definition above does: not specify that the. intended. effect be the. principal one. The 
implication of this is that action-centered speech acts are not restricted to the directives. 
The action-centered speech act class also includes members of the commissive and 
declaration classes where one of the intended effects is to cause the one of Scias specified 
above. The action-centered speech. act. class, then, -is.formed using “intended. effect” asa 
criterion 4 cuts across Searle's five major .speech aq classes. . 

The. action-centered “speech. acts used as-exammples in this paper are REQUEST, 
SUGGEST, and OFFER. 1 will summarize the major. differences. between them first and 
then give the preconditions of each. The first difference is the nature.of the action expected 
of .P2... For. OFFER, the expected action: is one: of accepting another. action. For 
REQUEST, P2 is expected to.take responsibility for. carrying out an action desired by PI, 
and, finally, for SUGGEST P2 is expected to consider a plan of action. A second difference 
between the three speech acts lies in the obligations invoked. .REQUEST :depends on 
appeals to gne or more of the three types of obligation discussed for ASK, while OFFER 
and SUGGEST depend on a combination of self-interest obligations and obligations.to “be 
helpful.” The choice of obligation for OFFER and SUGGEST is discussed further below. _ 
rather than information-centered because it is much closer to a nonverbal action: P2 must 
_ Rot only evaluate his.or ber obligation to P3 vis avis the information (P3’s right to.know) 
but P2 must also evaluate his or her obligation to Pk so. be.the.conveyer-of the information 
(PI's right to request). This level of complexity is closer to action-centered than to 


information-centered. speech acts. Note that my.argument hese. is based on Guideline 3. 1 
am appealing to larger patterns of action to. draw speech. act. boundaries, which. .then 


percolate up into class boundaries. A case can also. be made. based on Guideline 2: the 
simple interrogative form may be used for directives involving verbal actions with. Pl.as the 
destination, but it may not be used for verbal actions with a third party as: destination. 


41 


Taking each of the three example speech acts in turn, | start with OFFER. Although 
promising is the favorite commissive in the speech act literature, OFFER will be considered 
here because it has a rich ISA structure. It is also a speech act that is, in a sense, doubly 
action-centered. An OFFER conveys, first, Pf's desire to perform a service for P2 or to give, 
lend, etc. something to P2. Second, OFFER conveys PI's desire that P2 accept the service, 
gift. loan, or whatever. The preconditions given. for OFFER in Figure 3.2 clearly reflect 
this split. 

The appearance of the “he helpful" obligation in OFFER (VII) needs explaining. 
The concept that I am trying to express is Pr's obligation to accept P2's OFFER. At first 
glance, it seems enough to say that P2 has a Seat Clicabon to act in his or her own self 
interest (precondition VHI). This obligation, coupled with’ the fact that the propositional 
content of an OFFER is an action expected to benefit P2, gives rise to an obligation for P2 
to accept the OFFER. There seems to be nie than simple pragmatism, however, involved 
in accepting OFFERs. The obligation to accept is not merely ‘P2's obligation to further his 
or her own goals, but also P2's ébiligstion to PI to further PI's goals (i.e, P2's obligation to 
be helpful, precondition VII). In accepting an OFFER, P2 is enhancing Pl's image as a 
benevolent person, PI's satisfaction in giving, etc. By accepting, then, P2 is furthering PI's 
goats and being “hetpful.” 

The next action-centered speech act that we come to is SUGGEST (Figure 3.3). 
Recall the distinction above that SUGGEST is a directive that P2 consider an action, not 
necessarily that P2 ayn out: Note that thie tredtment’of SUGGEST is supported by the 


common use of a response such as 3.3. 


42 


Figure 3.2. The-preconditions for OFFER =... Ss 


Propositional content: some action that Pi believes will be of. benefit to P2 , 


1. Pl wants to take responsibility for the action. 
I. Pi believes that Pl can take responsibility for PI's part of the action. 


Hl. Ptis willing to take ages: for PI's ale of (ne action. 


IV. Pr wants Po to Soin some action that complemen Pr S part of the action. 


V. ‘PI believes P2 can perform some action that complements ‘Pls part of the 
action. eee Mee E365 


VI. Pl believes that P2-would be willing to.perform seme action that complements 
PI's part of the action. 


VIL PI ‘believe: that P2 oe an obligation (to “be hepa) to Pl perform some , 
action that complements PJ's part of the action, . 


VUIL. Pl believes that P2 has an obligation to P2.(by. virtue of P2's own: self-interest) 
to perform some action that complements PI's part of the action. 


Comments 


(1-(1H) "Take responsibility” is used here and elsewhere to permit subcoptracting,.. Whether 
PI actually baile the action of not, he or she stil! remains responsible for the results. 


(IV)-(VIE) Examples of complementary actions s would be (physicatty) taking food OFFERed 
by a hostess or getting into a car and. sitting in respomse to an. OFFER of a ride fram a 
friend. A general way to refer to P2's performance of a complementary action in response to 
an OFFERed action is to-say: that P2 accepted, eg.,. "Jane thanked, Pala aed.accepted the 
gift.” 


3 


Figure 3.3. The preconditions for SUGGEST 


Propositional content: an actton 


except for: actions in which Pl and P2 share joint agency 


I. Pi wants P2 to consider taking responsibility for the action. 
II. Pl believes that P2 can consider taking responsibility for the action. 
HI. Pl believes that P2 is willing to consider taking responsibility for the action. 


IV. Pl believes that P2 has an obligation (to “be helpful") to Pi to consider the 
action. 


V. Pl believes that P2 can take responsibility for the action. 
VI. PI believes that P2 is willing to take responsibility for the action. 
VII. Pl believes that there are some reasons why the action is desirable. 


VIII. Pl believes that P2 has an obligation to P2 (by virtue of P2's own self-interest) 
to consider taking responsibility for the action. 


Comments 


(propositional content) In excluding actions where Pl and P2 share common agency, I am 
merely arguing for a separate speech act, e.g, SUGGEST-COMMON-ACTION, to. cover 
such cases. I justify this distinction by an appeal to Guideline 2: this second speech act has 
some forms not shared by simple SUGGEST, eg., “Let's <action>.”. 


44 
3.3 That's a good idea. 


Note that the action to be considered by P2 may be a ‘verbat action directed at Pl. For 


example, to help P2 solve a problem, Pl might SUGGEST 3.4. 
3.4 Why don't you tell me what you've done so far. 


The unrestricted natute of the action suggested does not, however, threaten the status of 
SUGGEST as an action-centered speech act. This is because the action in the propositional 
content is that P2 consider the second action, and ‘considéring something is a nonverbal 
action 
The obligation preconditions for SUGGEST (IV and vill) are similar . those for 
OFFER. Once P2 has savaueine that an action is diaiesbie (precondition VIN), there is 
a default obligation that P2 consider taking ‘eeeaiia) for carrying out the action (VIII). © 
SUGGEST also appeals to P2’s obligation to PI to help further PI's goals (eg., enhanced 
self-esteem and recognition for Pl). The obligation arises from the fact that a goal of Pl is 
involved in a SUGGEST (precondition 1) This obligation is generally not as prominent for 
SUGGEST acts as for OFFERs, but it still plays a role. Consider, for example, the 
| politeness constraints involved in responding to a SUGGEST. A response such as 3.5 
violates the obligation in (IV). | - 4 


3.5 That’s a terrible idea. 


The third action-centered speech act that will be considered here is REQUEST, the 


directive by which PI attempts to get P2 to take responsibility for carrying out some non- 


45 


verbal action or verbal action not directed at Pl. One other class of actions that I am 
currently excluding from REQUESTs is remembering previous REQUESTS; that is to say, 
REQUEST as I define it does not include reminders. This can be justified by an appeal to 
Guideline 1 due to the difference in SAVs request and remind as well as by an appeal to 
Guideline 3. (Reminders are often followed by apologies or thanks by po, where simple 
REQUESTs are not.) Figure 3.4 lists the preconditions for REQUEST. 

In defining REQUEST, I follow Labov and Fanshel in citing an obligation. Just as 
for ASK, I further specialize the notion of obligation into authority, complementary role, 
and “be helpful” obligations.2° Another point that is worth mentioning is that my treatment 
of obligation subsumes the notion of PI's right to invoke the obligation. (See [14], p.78) The 
obligation stated for REQUEST and ASK is a three-place relationship between PI, P2, and 
‘the thing that P2 is obliged to do. Pl may be part of a larger set, so that the particular 
obligation may be to both PI and the society in general, eg., the obligation to drive 
carefully. Given this formulation, Pl has the right to invoke the obligation because PI is 


one of the parties to the obligation. 


This takes care of the speech act examples that will be used in this paper. There is 


one more piece of foundation that must go into place before the taxonomy of ISAs can be 


20. For REQUEST and, to a lesser degree for ASK, 1 have been tempted to define three 

different speech acts, one for each type of obligation. While there is some Guideline 2 

(difference in surface form) evidence for this, the bulk of the argument would have to rest 

on Guideline 3 (action pattern) evidence. The modelling of dialogue patterns is at too early . 
a stage for Guideline 3 evidence taken alone to be conclusive, so I stay with the more 

standard treatment of REQUEST. 


46 


presented. Accordingly, the next section presents a general representation scheme for 


actions. 


47 


Figure 3.4. The preconditions for REQUEST 


propositional content: a non-verbal action or a verbal action whose destination is not PI, 


except for: the action of remembering a previous REQUEST 


i PI wants P2 to take. responsibility for carrying out the action. 

H. Pl believes that P2 can take responsibility for carrying out the action. 

HI. Pl believes that P2 is willing to take responsibility for carrying out the action. 
IV. Pl believes that Pe is obligated to Pi (and possibly to others) to take 


responsibility for carrying out the action. (This obligation may be a role obligation, 
an authority obligation, or a general obligation to be helpful.) 


Comments 


(1) “Want” implies that P1 believes that the action has not already been carried out. 


(IH) At least for the purposes of reasoning, “willing” here includes P2 not being grossly 
unwilling. 


(IV) More than one of these different obligations may apply at once. 


This empty page was substituted for a 
blank page tn the original document. 


419 


4. Representing Procedures: The OWl.-1 Method 


In this section I discuss a general representation scheme for actions and patterns of 
‘actions. The discussion will add new perspectives to the treatment of taxonomy developed 
in the last two ‘atoa More important for my aioe here, this section will also form 
the foundation of the analysis of ISAs that is presented is Sections 6 and 1 Note that the 
scheme for representing actions described here is nk the only one that has been proposed. 
I defer a discussion of some of the alternatives until Section 82, where a comparison of 
static representations can be combined with a look at the way that the fépresentations are 
used in language processing. | 

I start the discussion of patterns of action with OWL-1, which is a formalism for 
representing knowledge. The basic unit of OWL-T is called a concept, and concepts are 
related to each other by a network of indices. The important point about concepts for this 
paper is that met can be prone into larger structures. The structure of interest here is 
called a method.2! Methods provide a high level, nenee revel declarative, representation 
for procedural knowledge. The high evel nature of the be hangiaaa oe means that methods 
can be used as the basis of an pension oy. (see Sarto Pep. Moreover: this 
property in combination with the highly interrelated nature of OWI.-1 concepts means that 
fcechoe structures ea be a very sisgieata aid in seciies of a knowledge base. My 


discussion of methods will avoid issues of notation. English translations are used for 


21. OWI, methods were initially defined by William A. Martin. OWL-I is the version of OWL 
that was used in the dialogue project; OWL has continued to develop since that time, and 
OWL-II is the current version. For an intreduction to OWI.-1, see Szolovits et al.[33] 


50 


examples, and the reader interested in OWI: notation :és: referred to Hawkinson[l3} and 
Brown(3]. 

With respect to content, methods are used to oe both melee and non- 
linguistic actions. For ia unsy can be eye represent an action such as meting a 
computer program for someone else, which normally ¢ contains “both Tinguse and non- 
linguistic Hep Methods can also be used to model tly non- gui activities such as 
baking. a cake. 22 In the c examples IT have considered and in the rest of this paper, ‘however, I 
wil focus 0 on what will be called core close methods. This isa group of semantic domain 
independent methods built around speech acts. (Core sathods may, nanwever: contain some 
non-linguistic steps, e.g., to RRS mental Processes) An example of a core method is 
request- -and- -respond, which appears in an English translation in Figure 41. (The details of 
this example will be se tees below.) English translations of. some other core ¢ methods can 
be found in ‘Appendix A. | _ 

Implicit in the preceding discussion is the point that methods may have one 
participant or several. The core dialogue methods model the pare played by both 
participants in the dialogue, a propery 1 call speaker independence “Methods can therefore 


be used either: from the Point ae view of 5 a on partipant to made both pars heard (or 


“sa 


read)! or from the point of view coh a param, in which | some utterances are generated 
‘and. some understood. 
feos at the structure of methods in more detail, we start with the three main parts: 


a header, argument specifications, and a procedural body. The header is the method's 


22. See Long [17] for the use of OW1.-1 methods to model programming knowledge. 


REQUEST-AND-RESPOND 


OBJECT: an action that can be the object. of REQUEST 
exceptions: 1. a helping action 2. a repetitive action 

AGENT: a _person.or.computer system 

CO-AGENT: a person or computer system 


method: 


I The AGENT REQUESTS the action of the CO- AGENT. 
2. The CO-AGENT. now knows the action desired, 

3 The CO-AGENT checks to see whether beliefs from the Preconditions 
of the REQUEST are justified... nat : 

4. The CO-AGENT acknowledges the REQUEST. 

5. The CQ-AGENT, at the appropriate time, takes responsibility for 
carrying out the action. 

6. PRINCIPAL-RESULT: Any results of the action desired obtain. 


recovery path J: if the CO-AGENT can't de what the AGENT says -- 
If the REQUEST jis based on a role obligation and the action 
REQUESTed is-not in one.of the CO-AGENT's roles with. respect to 
the AGENT -- 
Rit The CO-AGENT says the action is not in his or heg.role: 
R12 If the CO-AGENT wants to be. ii 
and - 
if the CO-AGENT knows a likely pafticpane to take 
responsibility for the action desired 
then 
the CO-AGENT. refers the AGENT. to this. dikely participant. 
recovery path 2: if the REQUEST was framed in a general way 
and the CO-AGENT can only.do a more specific, version.-- 
R21 The CO-AGENT describes the specific version that nm or ane 
can do. a 
R22 If the more specific discription matches the AGENT's eal ee 
then 
the AGENT says that is what he or she waited 
and 


REQUEST- AND- RESPOND continues ie g then new description 


otherwise. 


dialogue failure: the CO- AGENT can't do what the AGENT wants. 


Figure 4.1. An English representation of a method for cache an action 
and getting a as alias ‘ 


52 


unique name. Argument specifications, organized by semantic cases (see below), are used for 
type checking of inputs to the method (input eases) or to specify the form of results (out put 
cases). The’ procedural body is divided into two parts: foptional prerequisites and 
procedure steps. The prerequisites of interest here are those that are states. A stative 
prerequisite of an action is a condition that must obtain before that action is carried out. If 
the condition does not hold, then one must bring it about before carrphig out the action. 

The other part of the procedural body is the set of procedure steps, which come in 
two basic varieties, standard path and recovery path. This distinction will be discussed 
further below, but, basically, standard paths represent the ways that an exchange can “go 
right,” while recovery paths give some of the possible measures to be taken when an 
exchange gets “off the track.” A recovery path may be initiated when an expectation set up 
by an ongoing method is violated. . 

Since REQUEST is the speech act from which | will draw most of the examples, | 
will use the request-and-respond pattern to ithustrate the discussion of methods. In Figure 
4.1, request-and-respond corresponds to the header ef the actual OWL-] method. Request- 
and-respond takes three input cases -- OBJECT, AGENT, and: CO-AGENT -- and one 
output case, PRINCIPAL-RESULT.2> AGENT and CO-AGENT are the two participants 
in the method; by convention, the AGENT of an entire dialogue method is the AGENT of 
the first step, so that we can identify the AGENT of the method with the participant who 


Starts it off. The OBJECT of request-and-respond is the action desired by the AGENT. 


23. Semantic case names will be capitalized to avoid confusion with the normal use of these 
words. There should be no confusion with speech act names, which I am also capitalizing. 


53 


24 Input case specifications typically contain variables, which are tike other concepts except 
that their composition follows special rules. The fact that other concepts may be bound to 
variables permits nepresemeuans to be evaluated with reaps to some environment. (This is 
important when methods are used i recognition; see Section 9.) Cnmiprapert of input case 
Specifications omitted from Figure 4.1 is their use as a place to associate further entry 
conditions, ie. conditions that must be met before the method is applied. This point is 
discussed further below. 

Note that input cases are associated with methods, not surface English verbs. A list of 
the major OWI.-1 input cases, with informal explanations was written by William A. Martin 
and is reproduced with minor changes in Figure 4.2. 1 refer to these as the major cases 
because I do not see Figure 4.2 as an exhaustive list. There will be a significant number of 
‘inputs to sain that this list does not cover. Many of these can be fitted into the basic 
framework as more specialized versions of the major semantic cases. It appears, however, 
that some actions have totally idiosyncratic inputs which will have to be treated as special 
cases (in both senses of case). | 

Besides input case specifications, J} said that OW!-1 methods may have associated 
output case specifications, ie. results. One important notion here is that of principal result, 


which is the main result of the method and, typically, the reason that the method is 


24. The OBJECT of request-and-respond excludes helping actions, i.e, those in which the 
AGENT and CO-AGENT divide responsibility for the action.: These are represented in 
another core method, request-and-help (see Appendix A.) Atso*excluded are repetitive 
REQUESTs, eg. a standing order for publications. More effort must be put into 
bookkeeping so that the repetitive REQUEST can be responded to repeatedly. The steps 
differ enough to warrant the use of a separate method. 


Figure 4.2. The major OV1L-1 iaput.cases:. 


OBJECT 2k oo, | 4, Bee ah mye eerie A 
This is the input to the method that is most ‘mportint to the sehuiny 4 of the process. 
For example, the prqcess of ‘baking breadsis similar: no matter whe does it, -but baking a 
souffle is a very different process from baking bread. 
ex. Make-a.statue from clay. 
mens ine boat of water. 


AGENT 


The AGENT is the participant responsible for the action. 
ex. / hit the ball. 


CO-AGENT | 
-. This is someone who. basa responsibility evel equal to that of the AGENT for some 


part of the action. 
ex.. Harry helped me. mave the couch::; 


BENEFICIARY —_ 

This is an individual ‘who receives the benefit of the action without major active 
participation in it. 
ex. Answer the phone for Jofn. 


. SOURCE 

The SOURCE describes a former position or state of some other case-filler in the 
action (usually the OBJECT). 
ex. Take a block from the box. 


DESTINATION. es... 
The DESTINATION is to the future as the source is to the past. It is a point, aa 


etc. toward which some other.caserfiller in. the action (usually the OBJECT) is tending... 
ex. Tell it to the judge. 


TRAJECTORY 
This is the path taken by some other case-filler in the action. 
ex. Run across the street to the store. 


DIRECTION 


- This is the direction taken by some other case-filler in the action. 
ex. Head south across the desert to Mexico. . . 


SPECIFIC-LOCATION 
This is a location associated with the action. Some activities like “ride” take a 

' location which is quite specific to it; there are only certain things which can be ridden in. 

ex. Ride in a dog sled in Alaska. 


INSTRUMENT 

The INSTRUMENT is Somers used as a-tool in an action; it is left over 
afterward. 
ex. Cut the butter with @ knife. 


SPECIFIC-RAW-MATERIAL 
This is something used and consumed in the course of the action. 
ex. Bake a cake with powdered eggs. 


TOPIC 
This case is used for the topic of mental and communication activities. 
ex. Talk about a bear. 


EXCHANGE 
- This is something received in yexchonee for another case-filler in the action. 
ex. Trade seeds for food. : 


RATE 


This case gives a measure of speed, cost, etc. in relation to units or to something else. 
ex. Rent a room at three dollars a day. ‘ 


Figure 4.2 (cond.) The majer OW1.-I input cases 


56 


undertaken. For example, the action conveyed in "Paint the block red,” has as principal 
result that the block is red. The paint brush may also end up -ted, but this is not the 
principal result. In Figure 41 sealed enduespand as its PRINCIPAL-RESULT ‘output 
case set to the results of the action desired by the AGENT. When the method is 
successfully completed, the action has.occurred and its results. obtain. | | 

This takes care of the header and argument specifications; we turn ae to the 
procedural body, the first component of which is the prerequisite. ‘The request-and-respond 
example has no prerequisite. In general, eae have bce used in the representation of core 
dialogue methods. Nevertheless, prerequisites are a useful structure for other sorts of actions. 
A familiar example of a prerequisite is the requirement that an sieesiaty course of study 
be completed before a more advanced one is undertaken. Note that prerequisites differ 
from other entry conditions in that, if , resin is not satisfied, a. method is not rejected 
as a possible course of action. Instead, measures are taken to satisfy the prerequisite, the 
siccuticni of the method then continues. | 

The second part of the procedural body is the set of procedure steps. These are’ 
either calls to other OWL.-I methods used to model subactions, assertions of sates, or calls to 
“black box” procedures. The last are’ procedures represented in a conventional 
programming language (eg., LISP), these are intended for complex numeric or linguistic 
computations, or for any procedures whose steps are not to be included explicitly in the 
model. (We will see a use of “black box” representations below.) 


Recall that procedure steps were divided into standard path and recovery path 


57 


steps.2? 1 will say something more about each. First, standard paths are the normally 


. expected sequences of events for an action. They embody the relatively small number of 


ways that an action can be concluded successfully, ie. can be concluded such that any 


associated goals are met. When an English description ef an action is given, it is the 
standard path steps that are typically included. If deviations from the standard path(s) are 


described at all, it is only the most important ones that are given. This indicates that the 


_ Steps in the standard paths of an action embody the minimum of necessary information 


about it. For example, when giving someone directions for getting somewhere one normally . 
describes only the er routes. In the normat case, one does not discuss all the many 
ways someone could get back on the route again after having made a wrong turn. Even 
when a particular mistake is very common or particularly costly, one usually describes how 
to avoid it (eg.. "Don’t follow the signs") rather than how to tecover from it. 

To decide which steps belong to standard paths, one cati ask whether it would 
normally be necessary to describe a step when describing the action, or whether it would be 
necessary to describe some step of which it is a suibstep. (This second case applies to low- 
level detail left out because it is either obvious or tinnecessary.) Another useful criterion is 
whether the absence of a step is seen as an event. (This criterion is simitat to the one used 
by Schegloff and Sacks in [28] to determine what should go in a sequence.) For exampte, if 
someone asks a question and-does not get an answer, this is generally worthy of note. If one 


asks a question and the hearer does not ask for clarification, this absence of discussion is 


25. This is not quite the whole story. A third type of step is the assertion of a failure 
condition. Faifure conditions are represented as disjuncts off standard paths or recovery 
paths. “7 - 


not, in most cases, considered. remarkable. The standard path steps, then, are the normally 
expected . sequences of events for..are action, these whese nen-occurrence is normally. 
considered_an event. | 

It is fine to follow standagd. path steps as long a5.the action goes.as intended and no 
expectations are violated.. In practice this will probably: not-be Jong, and:a failure, with its 
attendant recovery actions, will resuk.2 Some recovery. actions are very general and should 
be modelled by independent methods. One type of general example is the recovery froma 
failure in the basic conditions of a dialogue, eg, noise on. the- tine in a telephone 
conversation. Many other. failures, however, ase quite specific:to the task being carried. out. 
Recovery from such failures is. modelled by. recovery paths. in: relevant methods...Recovery 
paths handle. domain specific types. of failure recovery, and one. of: their; advantages is that 
they allow the modeller to put in special instructions for. doing what is necessary to return to 
an appropriate standard path, 

The standard path steps for the request-and-respond. method are steps | through 6 in 
Figure 41. Two recovery. patts are. also shown for: request-and-respond, akhough many 
more exist to be modelled. The first recovery path in Figure 44, could be used :to account 
for a response such as 4.1, and the second could account. for the exchange in 4.2. responding 
to a REQUEST to write a computer program. 

4.1 That's not my department. Try Mr. Jones down the hall. . 


_4.2 CO-AGENT: I can only write. blocks world, programs. 
AGENT: That's what | want. 


~ 


26. For my purposes here, I define failure as the violation of. an expectation. This is 
slightly broader than its normal usage, and I do not intend for the. usual atid 
connotations to accompany it. re 


59 


The failures that trigger both of these recovery paths occur in the process of checking the 
preconditions, step 3, but the recovery paths are associated with the request-and-respond 
method. Different recovery paths for these failures could be used when a precondition- 
checking method is called from another method, e.g., request-and-help. Note that the second 
recovery path itself has an associated failure condition, which would justify the inclusion of 
an additional recovery path in the appropriate method. . Although only the recovery paths 
in Figure 41 contain conditional steps, note that such conditionals can occur in standard 
paths as well. 

One final point that Figure 4.1 illustrates about dialogue methods is their abbreviated 
form. For any given speech act; only production is. represented. explicitly, and the 
understanding process carried out by the other partner is left. implicit. For example, the 
first step of request-and-respond is that the AGENT makes the REQUEST, but no explicit 
step is given to represent the process of the CO-AGENT understanding the REQUEST. 
Thus, although the methods are speaker independent in the technical sense defined, they 
are not without a bias: it does not matter who is specified as the AGENT of a speech act 
step, but, whoever is, the “story” is. told from his point of view. “Listening” steps are left 
implicit not because they are unimportant, but because the form and timing are predictable. 
. Where a joint model of communication is necessary (e.g. when misunderstandings cea it 
is necessary to expand the abbreviated model expressed in the dialogue methods. 

This takes care of the request-and-respond example and with it the general 
description of methods. A few words remain " be said to link methods to the treatment of. 


speech acts described in Section 2. First, note that speech acts such as REQUEST have 


60 


their own associated methods. This is in accord with the major thrust of speech act theory, 
which views utterances not merely as saying but alse as doing. The speech act methods 
have semantic case specifications for an OBJECT, AGENT, and DESTINATION’ Here, 
the OBJECT specification generally corresponds to the proposittorial content condition 
identified by Searle. For all speech act methods, the procedure steps are represented by a 
call to a “black box” procedure for generating utterances. The preconditions of speech acts 
are represented in the methods as entry conditions associated. with the AGENT of the 
speech act. Preconditions also appear, possibly somewhat restricted, as constraints on the 
AGENT of the higher level core methods buik around the speech acts. 

Turning to speech act taxonomy, recall that | adopted the principal intended effect of 
the speech act as a major taxonomizing principle. This notion of intended effect can be 
related to core methods via the OWL-I output case PRINCIPAL-RESULT. More — 
Specifically, the principal intended effect of a speech act corresponds to the action directly 
precipitating the PRINCIPAL-RESULT(s) of a core method buik around the speech act.28 
For the request-and-respond example, the principal intended effect is step 5, whose results 
become the PRINCIPAL-RESULT of the core method. 29 We tan say then, that core 
27. DESTINATION is used rather than CO-AGENT because the second participant in a 
speech act is seen as passive. This does not mean that this participant does nothing, since 
he or she is doing the work of trying to understand the utterance. By passive I] merely mean 
that the second participant is basically receiving the information, rather than playing a 
major rote in shaping it. 

28, Where a speech act has more than one associated core method, the principal intended 
effect will be a generalization that covers the different actions that directly precipitate the 


PRINCIPAL-RESULT(s). 


29. Note that principal intended effects are often, but not always, the last step in a core 
method. For example, we can construct a core method called state-and-acknowledge, 


61 


methods accomodate the taxonomizing principle used for speech acts in Section 2. 


neat 


In summary, methods provide a general representation for actions. In this section, I 
have emphasized the way that speech acts can be integrated into larger patterns of action, 
using methods as a representation. In Section 7, method structure will be used in 


formulating an important class of rules for ISAs. 


consisting basically of a speech act of stating (done by Pl) followed by an understanding 
and acceptance operation (done by P2). This latter step would be the principal intended 
effect of the speech act, but it is not customarily the last step in the core method. Instead, the 
. last step of state-and-acknowledge is typically an acknowledgment (done by P2). 


This empty page was substituted for a 
blank page tn the original document. 


63 


5. Three Approaches to the Treatment of Indirect Speech Acts 


All of the foundation is in place, and we can now turn our pa henen to ISAs. To 
_ Start, f briefly outline three approaches to the treatment of ISAS that fave been? prominent 
in the linguistics literature. Too much has been written on this topic for me to attempt a 


comprehensive summary here, and | intend only to use these approaches to motivate the 


analysis in Sections 6 and 7. 


Sl. Goren and Lakoff 
The first approach of interest here is that ‘taken 8 Gordon and Lakoff [10]. 

Concentrating primarily on requests, Gordon and Lakoff ihe are a set of four sineerity 
conditions*© and then give a ame powerful rule to account for the different ways that a 
request can be framed. They aay that to make a sincere request, 3 speaer must want the 
action done and believe that the hearer can do the action, unas he wants to oe the action, 
ane that he would not do it if he were not asked to. The first of panes apeaiies concarions 
_ is called speaker-based and the next three are called Aearer-based. The rule given is: 

One can convey a request by (a) asserting a speaker-based sincerity condition ‘or 

(b) questioning a hearer-based sincerity condition. 
This formutation is very attractive pasibeen is so elegant and simple, but it has 


encountered extensive criticism: Sadock [24] points out that Gordon and Lakoff's rule is 


30. It is not clear whether Gordon and Lakoff ‘mean something different by sincerity 
condition than Searle does. At any rate, their choice of sincerity conditions for requesting 
differs from Searle's. 


64 


too powerful, admitting utterances that are not requests. For example, 5.1 is not.a request for | 
the hearer to move over, even though the similar form 5.2 is. 

1 Tell me if you can move over. 

5.2 Can you move-over? 
Even if the set of sincerity conditions could be recast to generate only the correct forms, 
there are still facts that cannot be accounted for by an approach that uses only speech act 
conditions plus a general rule. Sadock gives the following examples: 

5.3 Can you close the door? 

5.4 Are you able to close the door? 

5.5 Can you please close the door? 

5.6 Are you able to please close the door? 
Sentences 5.3 and 5.4 can both be used as requests, and in this oes they are paraphrases of 
each other. Taking Gordon and Lakoff's approach both 53 and i would therefore be _ 
‘derived from the same sincerity condition. Sentence 56, however, is ungrammatical, and 
any scheme that derives 5.3 and 5.4 unto wilt be hard pres to account for the 


difference between 5.5 and 56. 


5.2. Sadock 

To answer the objections that he raises against Gordon and Lakoff’ theory, Sadock 
in [24] distinguishes between utterances. that have a speech act as their meaning and 
utterances that mean one speech act but entail another,. Thus, accerding te Sadeck, 5.3 is a 
sort of idiom which entails the notion of request as part of its meaning, while 5.4 is a 


question according to its meaning, only secondarily entaiting a. request. Sadock’s claim is 


65 


that, although rules based on sincerity conditions have their place, they cannot account for 
the behavior found in speech act idioms. 

Sadock's observations seem to’ be well taken; although ‘his distinction between 
meaning and éntaifment appears to me to be uttimatety misleading: The difficulty, I think, 
is that the notion of what a meaning is'arid what information it should contain are not as: 
well understood as the theory implies. Sadock assumes that each’ sentence has only ofie 
semantic representation that constitutes its meaning, and he ‘himself ‘notes that this makes: 
him unable to account for the deliberate i of ambiguity ‘iti cofivetsation. 

There is other evidence that Sadock's treatment falls short’ of a precise 
characterization of ISAs. Note that some sentences treated by Sadock ‘as speech act idioms 
have conversational properties based on their surface forms. Consider aarp 5.7 and 58 
used as requests. os 

5.7 Move over. 

-58 Can you move over? 

59 OK. 

5.10 1 don’t know if I can. 

5.1l Yes. 

512 1 don't know. 

Examples 5.9 through 5.12 are possible responses to 5.8, but only $9 and 5.10 are aaoraste 
as responses to 5.7. The indirect request form 5.8, then, can be treated as either a request or 
a question by the hearer for the purpose of response. Note, however, that 58 can still be 
understood _ a is ee though it is pee asa question One ay respond with 
"Yes" and then go on to aes with the request. In fact, to sasieed “Yes” and then fail to 


comply indicates either deliberate rudeness or failure to understand the request force of the 


utterance. If we follow Sadock and treat 5.8 as an idiom when used as a request, then only 
then impossible to account for, the response. behavior:described. .... . . 

Another phenomenon. that-Sadack's theory. does not account for is the variation in the 
indirect forms that are possible for. “entailed”: requests... For example, Sadock considers 
questions of the “Are you able to ..” form to entail requests.. He also observes that questions 
of the form "De you know <interrogative clause>?” can have, the illocutionary. force (and 
hence in his scheme the. meaning) -of the interrogative clause. That is, 513 is often 
equivalent to 514 rather than to.a question about the knowledge of the hearer. 

‘5.13. Do you know. what time it is? ae 

5.14 What time is it? 

Putting these two facts together, Sadock's theory would predict that 5.15 should entail : 
request, but it does not. 


5.15 Do you know whether you are able to close the door? 


Hence, even forces that Sadock considers entailments of utterances -- and. therefore totally 


meaning-dependent -- seem to have some dependence on surface form. 


5.3. Searle 
A third approach that is of interest to us here is that of Searle in (30) I start with a 
discussion of some generalizations that he proposes and then compare his approach to that 


of Sadock. 


67 


Searle presents four generalizations for directives and five others. for commissives. 
These play a role in Searle's theory analogous to Gordon and Lakoff's single rule, but 
Searle's generalizations differ from theirs in-the following important ways: 

1. Searle's generalizations are differentiated according to the parts of the speech act, 

1.e. propositienal content conditions, sincerity conditions, and preparatory conditions. 

Gordon and Lakoff's sincerity conditions, in contrast, seem to be an amalgam of 

_ Searles’ sincerity and preparatory conditions. re NG : 

2. Searle explains an asymmetry in his generalizations by. identifying the 

conversational participant who has the better knowledge of a condition, but he does 

not state this explicitly in the generalizations. For Gorden and Lakoff, the question 
of which participant has the better knowledge is at the core of their single rule. 

Searle's approach seems to. me to be a valuable one, in that the generalizations are 
more finely differentiated than Gordon and Lakoff’s rule. At the same time, I think that 
Searle's generalizations can be questioned on the count that they are too specific.. Recall 
that the notions of speaker- and hearer-based conditions have not been given an explicit 
place in the generalizations. Because of this, Searle is. forced to state the generalizations in 
terms of specific types of preparatory conditions, for example, rather than in terms of 
preparatory conditions. in general. Although Searle is correct in looking for finer 
differentiation than that afforded by. Gordon and Lakoff's rule, it appears that his 
generalizations are needlessly complex. See Section 6 for an alternative proposal. 

Searle's generalizations are presented as part of a larger framework. It is instructive 
to compare this framework with that developed by Sadock, ‘since there are some sharp | 
‘contrasts. Searle makes the distinction between meaning and use, saying that an utterance 
such as 5.16 should be seen as meaning a question while, at the same time, it-may be used as 


a request. 


5.16 Can. you pass the sah? : 

5.17 Are you able to pass the salt? 

_ The fact that 5.16 takes please while 5.17 does: not is accounted for: by a maxim that one 
should speak more in the normal case. Thus, Searle sees a cea such as 5.16 as 
"idiomatic but not an idiom.”” One attraction of this approach ts that # accounts naturally (as 
Searle points out) for the fact that responses to speech acts may correspond to surface form 

as well as to a conveyed speech act. 

There seem to be two major. difficuties with Searte’s approach. First, akhough he 
acknowledges special grammatical bohiavior of some ISA forms, Searle is not willing to say 
that such forms “mean” any speech act other than the one corresponding to the surface 
form. Utterances such as example 5.5 ("Can you please..” forms) are treated as special 
conventionatized forms, which is to say as very idiomatic (but stiff not idioms). If the please 
in 53 is to be accounted for, this apparently must ‘be done in terms of use, not meaning. 
Thus, we have both use and meaning reflected in some surface forms, which blurs the 
use/meaning distinction to some extent. To justify such a step, 1 think we need a more 
developed account of the notion of “use” and: a delineation of its impact on surface form. 

| Another difficuky with a strict meaning/use distinction appears in the variations 
possible for a class of ISA forms that wilt be discussed in detail in Section 75. When 5.19 is 
used in place of 5.18, } assume that Searle would calf the statement that is conveyed the 
meaning ahd the request that is conveyed the use. How, then, does 5.20 fit into: this 
framework? 

5.18 Close the door. 


5.19 It's cold in here. 
5.20 Must I tell you that it’s cold in here? 


69 


As I analyze it, example 5.20 can be used to convey three speech acts, first the "Must I tell 
you..?" ASK, second the “It’s cold in here” representative, and third the “Close the door” - 
REQUEST. Note that either response below would be acceptable: 

5.21 Oh. I'll close the door. | 
4.22 All right. I'll close the door. 
The following response would also be possible, although rude: 


5.23 Yes, you must. 


It is not clear how a strict meaning/use approach would account for the three levels of 
representation involved here. Note that any change to the theory to accomodate “triple 
message” ISAs will not be simple to make, due to restrictions on degree of indirection for 


other ISA forms, as exemplified by 5.1 and 5.2 above. 


see's 


It appears, then, that none of the ISA theories presented is totally adequate. All, 
however, have extremely attractive aspects that are worth preserving. Gordon and Lakoff's 
approach attempts to derive the ISA forms from general speech act condos that have 
independent uses. Sadock, while endorsing such general derivations, emphasizes the special 
grammatical properfles of some ISA forms. Although Searle's overall approach differs from 
that of Gordon and Lakoff, his “generalizations” can be viewed as a more finely 
differentiated version of Gordon and Lakoff's (too) simple principle for deriving ISAs. In 
the approach described in the next sections I have preserved these positive aspects while, 


hopefully, avoiding the negative ones. 


This empty page was substituted for a 
blank page tn the original document. 


6. Some Genera! Rules for Indirect Speech Acts 


One common h property of the three theories discussed in the bs section is te use of — 
general rules to account for ISAs. While the theories dite in sad nature - Individual rules 
and the relative importance given to them, nevertheless sears rules § for ISAs are included 
in each case. In light of the large number of ISA forms, such general rules ba age to be 
necessary. I follow Sadock, however, in questioning whether me are sufficient In this 
section I present a set of general rules for deriving ISAs iro properties of speech acts. 
These rules are intended as a basis only, and they wil need to be both sugmen ee and 
restricted. Accordingly, in Section 7 I present another set of rules and categories that, 
combined with the general rules from this section, give a more complete picture of the range 
of ISAs possible. The reader is asked to keep sa ab then, that this sti is intended as 
only part of a theory, to be amended in Section 7. - 

The ISA rules presented in this section will be divided into three categories. The 
categories are based on a common sense view of rational human behavior. The categories 


are based on the following three maxims of action: 


I. One should (only) initiate actions that are necessary. 


2. One should (only) initiate actions for which some desirable result or a can ibe 
se anil 


3. One should only initiate actions that one expects to be postible: 


These three maxims will be referred to as the: maxims ‘of Necessity, Desirability, and 


Possibility. 


nr 


Readers familiar with the classic work of Grice. on-conversational.implicature ti) wit 
recognize the approach that is being taken. There, Grice suggests four categories of 
maxims that are applicable to linguistic actions but which have stares in other types of 
actions. Here, 1 am stating maxims applicable to actions in general but which a to” 
speech acts as ‘ special ‘aie The maxim of Necessity above has a counterpart in Grice’s 
category of Quantity. The other two maxims have no direct counterparts, and they suggest | 
extensions to Grice’ framework. | 

| We will use the maxims of Necessity, Desirability, and Possibitity to provide a 
conceptual organization for ISA rules. Accordingly, the next three ‘subsections discuss cules 
of te type. Section 6.4 looks at some general behavior with respect to mood and tense. 
Finally, in Section 6.5 I discuss a fourth type of ISA that may call for a somewhat different 


approach. 


6.4. ISA Forms Related to the Maxim of Necessity 
The maxim of Necessity says that one should act when necessary, but one should 
avoid extraneous actions. When this second clause is applied to speech acts, it yields the 


following rule: 


Rule 6.1 


Pi can convey a speech act indirectly by -- . 


(i) ASKing whether the intended speech act is necessary 


eg., the REQUEST "Do I need to tell you to shut the door?” 


73 
(ii) ASKing whether an equivalent speech act (ie, one with the same principal 
intended effect) has Baleady been performed 
"eg. the REQUEST "Did anyone ask you to take out the garbage?” 
(iii) ASKing whether the principal intended effect can be ae to eccur without: 
the speech act 
eg. the REQUESTs: 


“Are you planning to take out the garbage?” 
“Are you going to take out the garbage?” ~ 


(iv) ASKing whether the principal intended effect of the speech act has seareacy 
occurred 
eg. the REQUESTs: 

“Did you take out the garbage?” 

“Have you taken out the garbage?” 
For Rule 61 and other rules presented in this paper, | am assuming a relatively constrained 
relationship between the actual representations of the rules and the semantic representations 
of individual utterances. See Appendix B for a description of this relationship. Although 
REQUEST examples were used in 6.1 (and will continue to be used in this section), the 
general ISA rules are intended to apply to any speech act where P2 can be expected to have 
the appropriate knowledge to respond to PI's ISA. 


In Rule 6.1, note that Clause (iv) is also expected to account for forms such as 6.1 used 


as a REQUEST to take out the garbage: 
6.1 Is the garbage out? 


This will require additional rules and distinctions, however, which will be introduced in 


Section 7.5. 


4 


- Earlier versions of this paper. necounted for ue you <action>7” REQUEST form 
by an appeal to Clause (ill) of Rule 61. I have. not pated satisfied with this sccoure: 
however, because some uses of the form are not motivated by cautions of the necessity of 
the action. Consider, for example, 6.2. 


62 Will you accept a ride to the airport? 


This example can be accounted for by the maxins of Possibility.. .One can view 6.2 as Pl 
asking P2 whether the outcome of an OFFER by Pi will be successful (Le. acceptance). 
While some readings of the “will” form may be accounted for by Rule 611, we need a broader 
view of this form. See Section 6.3. 

Finally, note that Pl is permitted to use an ISA only. ern Pi an. reasonably expect 
P2 to decifer PI's intent, ie, to recognize the indirection. Neither Rule 6.1 nor any of the 
other rules presented here, however, includes. this information. It ‘appears that. this 
constraint is part of a more general constraint that Pl avoid ambiguity. That is, Pl is 
obligated - tothe best of ins or her ability — to frame any utterance (ISA or not) in such a 
way that P2 can understand the menage that Pitnded to convey See Grice [ii] for 


discussion of an “avoid ambiguity” maxim. 


6.2. ISA Forms Related to the Maxim of Desirability 
Next we come to the maxim of Desirability, restated here. 
One should (only) initiate actions for which some desirable result or results can be 


expected 


Retated to this maxim, we have the following general ISA rule: 


Rule 6.2: 


Pl can convey a speech act indirectly by -- 


-- a representative with the propositional content. that-some desirable resuk or resuks 
can be expected for some intended effect of the speech act. 


eg. the REQUEST “I would be happier if you'd substantiate those figures.” Here, 

the desirable result is the happiness of Pl, and the intended:effect.of the REQUEST - 

is that P2 substantiate the figures. 
Again, for Rule 6.2, although the example is a REQUEST, the rule is expected to apply to 
speech acts in general. 

In Rule 6.2, note that the intended effect need not be an immediate RESULT or 
PRINCIPAL-RESULT (see Section 4) of the speech act; it may be several times removed in 
the causal chain. Similarly, the desirable result need not be an immediate RESULT or 
PRINCIPAL-RESULT of the intended effect. Thus, we have the following examples of 
indirect representatives. | 


6.2 You'll be happy to’hear that Claudia, won. 
(Hear refers to a RESULT of the speech act.) 


6.4 You'll be happy to learn that Claudia won. 
(Learn-refets to the: principal: intended. effect-of the speech act.) 


65 You'll be-happy to know that Claudia.won. . 
(Know refers to the PRINCIPAL-RESULT of the principal 
intended effect of the speech act.) 
Note that the desirable results: from the: maxim need aot relate only.to Pl and P2. 
Consider the following REQUEST example, taken from.[I4] 


6.6 This room would look a lot better if you dusted it. 


% 


Example 6.6 contains information about an inanimate object with the claim that it would be 
more attractive (to anyone) if the REQUESTed action were performed: Neither Pt or P2 is 


explicitly meritioned. in the statement of the resuk. 


. 63 ISA Fornis Related to the Maxim of Possibility 

The third maxim proposed was the maxim of Possibitity ‘ne should only initiate 
actions that one expects to be possible. The minimal conditions for this maxim to obtain . 
can be stated in terms of the framework developed in Section 4. A speech act can be . 


considered to be possible if: 
1. the preconditions of its associated method can be satisfied. 


2. for an action-centered speech act, the semantic input cases of the action named can 
be filled. 


3. for an action-centered pene act, the tea mand ‘of sca action named can be 
satisfied. 
The maxim of Possibility has a rich set of corresponding ISA rules. Rules related to (1) and 
2) above are explored in the next two subsections. Rules related ta {3}: above are discussed 


in Section 7.5.2. 


6.3.1. Forms Based on Preconditions of Individual Speech Acts 
The first set of ISA forms based on the maki of Possibility are those derived from 


preconditions of speech acts. Note that the preconditions assotiated: withy the speech acts 


Tl 


give conditions particular fo the speech act. There are other general conditions of speech acts 
which must obtain for the individual act to be well-formed,-as j eritenced uid rules. 61 and 
6.2. 1 am not including these when I refer to preconditions. The approach taken in this 
subsection will be to distinguish three classes of precondition and formulate seven rules 
using the classes distinguished. 
While the rules suggested are intended to apply generally to ok acts, once again — 
REQUEST will be used to supply examples. (Examples for OFFER, SUGGEST, and ASK 
are given in Appendix C.) To start, then, recall the preconditions identified in Section 3 for 


REQUESTS: 


J. Pi wants P2 to take responsibility for carrying out the action. 
Il. Pl believes that P2 can take responsibilty for abl) out the action. 
IIT. Pl believes that P2 is willing to take responsibility for carrying out the action. 
IV. PI believes that P2 is obligated to PI (and possibly to others) to take responsibility 
for carrying out the action. (This obligation :may*be'a eee ene an authority 
obligation, or a general obligation to be helpful.) 

The preconditions of REQUEST, and other speech act’ preconditions, can be divided 


according to which dialogue participant has the better knowledge of'the conditions specified. 


1. Pi-based preconditions. 


Here PI has inherently better knowledge: of whether -or mot the topic of the 
precondition holds. The topic of preconditions that begin here with "PI believes” is 
considered to be the direct object of the initial "believe." ‘Por other preconditions (eg., 
REQUEST I), the topic is the entire pattern. Preconditions that are © Pr based include 
‘those ‘whose topic is a goal of Pl (eg., REQUEST E). 


2. P2-based. preconditions. 

Here: P2 has inherently better knowledge of whether or not the topic. of. the 
precondition holds. Preconditions that fit this category include PI's beliefs about P2's 
intentional states. An-example.of a P2-based-precnadition-s REQUEST Ajll). 

3. Unmarked preconditions. 

For these preconditions, determination about which participant has the better 
- knowledge of the pretenditien depends on: properties of. the aid Speech act 
and/or its context. Examples are REQUEST an and avy. 

Using these precondition types, we can construct the following seven rules for ISA 


forms. 
Pl can convey a speech act indirectly by -- 


Rule 6.3 (for Pi-based preconditions). 
a representative of the topic of a Pl-based precondition of the speech act! 
eg.. | want you to water the plants. (REQUEST 1). | | 
I hope you will use common-sense. (REQUEST L) 
Rule 64 (for P2:based preconditions): 
» -- an ASK of the topic of a P2-based precondition of the. speech act. 


e.g.. Do you want to shut the door? (REQUEST HIE) 


Rute 6.5 (for unmarked preconditions). 

-- an ASK of the topic of an unmarked precondition of the speech act.. 
a1. It is probably the case that:for all PHbased preconditions, the topic is equivalent to 
the entire precondition. This rule could probably be stated simply as “a 


- representative of a Pl-based precondition,” but for now I stay with the more general 
version. 


= 
This rule applies in a context where Pl believes P2 has better knowledge of the 
condition in the precondition topic. 


eg., Is it your turn to-do the dishes? (REQUEST IV.) 


Rule 6.6 (for unmarked preconditions): 
-- a representative of the topic of an unmarked precondition of the speech act. 


This rule applies in a context where PI believes Pl has better knowledge of the 
condition in the precondition topic. 


eg.. It's your turn to do the dishes. (REQUEST IV.) 


Rute 6.7 (for unmarked preconditions)32 


=> @ representative of an unmarked precondition of the speech act where PI 
ee he or she has better Rnawledee of the condition. 


eg.. | believe it's your turn to do the dishes. (REQUEST IV.) 
1 assume you are able to shut the door: (REQUEST H.) 


Rule 68 (for groups of preconditions): 
-- a REQUEST form of an action that is a goal of Pi (ie, A for "P1 wants A”). 


This rule is applicable only when the speech act has preconditions that are 
‘exact matches or specializations of the four preconditions of REQUEST. 


- eg. Take a cookie. (OFFER 1V-VH) 


32. Utterances derived from this rule could also be treated as double indirections, eg., an 
indirect form of a representative of the speech act. If this is done, however, these 
utterances would be special cases, since double indirections are not possible in general. 
(Recall example 51. "Tell me if you can move over,” which is not a REQUEST that P2 
move.) Whatever the treatment of these utterances, note that the initial "I believe,” “I 
assume,” etc. functions more as a mitigator than as a Seer speech act.'For a discussion 
of mitigators, see [14]. 


- Rule 69 


-- an ASK about whether P2 will take responsibility ‘for. carrying out an 
“active” action that: is a goathof Pl (ie,.A for °“Pl.-wants, a) 8 to i 


This rule is applicable only when the speech act has preconditions that are 
exact matches or specializations of the fous: preconditions.ef REQUEST... 


~e.g., Will. you. accept a bide fo-the airport? (OFFER 1V.;VIL) 


Although these seven rules are more compe than Gordon and | Lato single rule, 
they give us a more. powerful basis for ening the set ot 4 peaconsitioe:-Haxed ISA forms. 
First, nothing corresponding to Rules 6.8. and..69.-(which..operate on ome of 
preconditions) -is found in Gordon and- roms vam ‘Second, the treatment of the 
precondition classes is more powerful here. Nothing in Gordon and Latofts dichotomy 
between speaker- and shearer-based sincerity conditions predicts cians ‘that can either 
be questioned or asserted, depending on context. 

The ma jor difference between Rules 6.3 to 69 and Searle's generalizations rests in the 
level of generality of the two y schemes. Recall. that Searke's generalizations are more 
numerous because they are associated. with different classes. of. speech acts.. In: contrast, the 
rules given here apply to speech acts in general, awd they are proposed as the full set of 
general ISA rules that apply to preconditions. 

In terms of specific rules, rules 6.8 and 6.9 differ most from Searle’s generalizations, 


because, again, they operate on groups of preconditions. Rule 6.9 is of special interest. In 


33, By “active” | mean-an action that is viewed as: an event in the environment 
Shared by Pl and P2. For further discussion, see Section 6.4. 


8i 


Searle's scheme, the very common “Will you <action>?” form is derived from the 
propositional content condition of directives. I feel that this approach lacks semantic 
motivation, and so the approach taken here is to appeal to the maxim of Possibility. One 
can interpret a “will” question as a een about how P2 will respond to PI's. speech act. 
The “will” ISA form can be seen as a hedge in case P2 cannot respond appropriately, does 
not wish to do so, or recognizes no obligation to do so. By using this form, PI can carry out 
the intended speech act even though he or she may have-some reservations about its effects. 
Anchoring “will” forms in. groups of preconditions..and appealing to the maxim of 
Possibility provides these forms with a stronger semantic motivation than the more 
structural account offered by basing rules on propositional content conditions. | 

Given the constraints on matching described in hapenank B, Rules 6.3 through 6.9 
- are relatively cedtviaive I will conclude this abet seithieone observations on the 
forms that the rules do not account for First, Secu as written do not account for 
differences in tense and mood. That is, Rule 6.5 generates example 6.7 but a 68 and 6.9. 

6.7 Are you able to drive Sarah to school? | Ny dais 


68 Will you be able to drive Sarah to school? 
6.9 Would you be able to drive Sarah to school? 


Since 6.8 and 69 are. legitimate indirect REQUESTs, they must be accounted for; Section 


er 


6.4 contains proposals for handling this = and enood behavior. 
_ Another collection of forms that is not covered by Rales:63:to 69 are those that | 
contain negations.of the sort found in patterns 6.10 to 6.13: . 
6.10 Won't you <action>? 
6.1 Couldn’t you <action>? 


6.12 Shouldn't you <action>? 
613 Can't you <action>? 


82 


Application of Rules 6.3 to 69 to the REQUEST preconditions does not yield such negated 
forms. In this case, FE think that the exclusion is: well-founded: Forms 6.10 to 6.13 can be 
interpreted a$ conveying several different speech acts, bot two of the more prominent ones 
are begging and persuading. What these two speech acts have fn common is PI's need to 
overcome. some sort-ef resistance in P2. (The resistance ts to the’ action named in the speech 
acts; both are action-centered.) REQUEST does not: involve overcoming any special 
resistance; precondition (1H) retates to Pf's estimate of P2Z’s openness to the action. Thus, 
while forms 6.10 to 6.13 are clearly similar to REQUEST forts, they are not treated here as © 


REQUESTS. 


6.2.2. ISA Forms Based on Semantic Cases of Actions 

We continuing with the maxim of icp and observe that nn forms for some 
types of action- centered speech act are based on semantic input cases of ine are named. 
Consider, for sich a 


6.14 Do you have a rope to throw to that man? 
(OBJECT of the throwing action) - 


6.15 Have you got a hammer to fix that pictare?. 
(INSTRUMENT of the fixing action) 


616 Do you have enough gas in your car to drive George to the. xirport? 
(SPECIFIC-RAW-MATERIAL of the driving action) 
All of these examples can (among other things) act as indirect REQUESTs for P2 to do the 
action named. Given the appropriate context, they can also act as indirect SUGGESTS. 


(One such context would be the case where P2 is trying to decide what todo in:a situation.) 


83 
The basic pattern that these examples fit is given in the following rute:*4 


Rule 6.10 

Pi can convey a REQUEST or SUGGEST indirectly by -- 

-- ASKing whether P2 has <item that fitts'a semantic ease slot> for/to <action> 
Here, the semantic case is one defined for the action named. . is not clear whether this 
formulation is too broad; a pattern nN i a subiet of me semantic cases may be 
more ae For instance, a parine in 6 is a filter for the CO: AGENT. semantic case. 
Example 6.17 uttered by a tournament official can define be a REQUEST that P2 eet a 
partner, but can it also be used asa REQUEST that P2 play the next match (as Rule 6.10 


would have it)? 
6.17 Have you gota partner to play the next match? 


While eialnfhe 617 sified that Rule 6.10 is e broad, other einen ieee that it . 
is too restrictive. In addition to examples that correspond quite well to the semantic input 
cases identified in Section 4, there are other examples that are clearly related but not as easy 
to account for. Consider: | | 

618 Do you have time to take Sarah to the sian? 

619 Do you have room te-store my plants? | 
Examples 6.18 and 6.19 are time- and space-related, respectively: When representing actions 


with OWL-I methods, it is assumed that all actions occur at some time, but, because of the 


34. It is possible that this rule can be generalized; since anatogous, but not atways identical, 
forms exist for other action-centered speech acts. 


84 


generality of this attribute, time is not represented as a semantic. input case, (This does not 
preclude the use of a different representation to associate times with particular events.) Even 
if the starting time of an action were represented as a semantic input case, this would not be 
an immediate solution to the problem of accounting for example 618. This ts because the 
time discussed there is elapsed time, not a single point in time. Similarty, example 6.19 can 
be related to the semantic case SPECIFIC-LOCATION, but it seems to be a question about 
the size of a place, rather than simply a place. While we could consider extending Rute 6.10 
to include attributes of semantic cases, we run the risk of sdmiting other forms that are not 
ISAs. Pending a solution to the difficulties poe by examples 6m to 619, I leave Rule 6.10 
as stated. This is done, however, with h the c caveat that it merits further investigation. 

To sum up this subsection, we can say that for at least some action-centered speech 
acts, ISA forms are based on semantic input cases of the, action named. Work remains to be 
done to establish the exact subclass of action-centered sai acts that permit this aps of 


ISA form. 


6.4. Tense and Mood Variations 

Tbs completes the presentation of ISA rules associated with the three maxims of 
action. In this subsection | briefly discuss the deriration ol. future. tense ISAs, then go: into 
somewhat more detail for subjunctives. 


The following rule expansion can be applied to all ISA rules: 


Expansion for future tense: 


If an action-centered speech act refers to. an action that is to take place sometime 


85 


other than, “right away,” then the ISA form: may be stated in the future tense, if 
this is appropriate. 
Although the future expansion and the subjunctive expansion below are rules, note that 
they are not at the same level as the other ISA rules presented so far. The expansions are 
rules that act on other rules. Alternatively, it would be possible (but wordy) to write 
individual ISA rules with separate clauses to account for tense and mood behavior. 
In the future expansion, the “if this is appropriate” hedge is used to account for 
differences such as those between the REQUEST forms 6.20 through 6.23. 
6.20 Wall you be able to <action>? 
6.21 Can you <action>? 
6.22 Wat you be willing to <action>? 
6.23 ?Will you want to <action>? 
While 6.20 and 6.22 occur in the future tense, there is no future fer can and 6.23 is 
questionable as an indirect REQUEST. Some properties, apparently, are viewed as more 
time-variant others. It is this difference that the “if appropriate” hedge is meant to allow 


for. 


Turning to the subjunctive, we can frame the following rule expansion: 


Expansion for subjunctive: 


ISA rules that generate ASK forms where the state or action that will be named 
by the finite verb is not. set in the past may also be used to generate subjunctive 
forms if the conveyed speech act nas a dene intended effect that inyoryes 
active epee on the part of P2.— 


By “active” | mean a response that is viewed as an event in the environment shared by Pl 


86 


and-P2: This constraint is used to include utterances such as the indirect ASK 6.24 while 
excluding those such as the indirect representative 6.25. _ _ 

6.24 Would you know if I can get a boarding pass at that desk? 

6.25 «Would you know that the plane left already? | 
I am assuming that both of these utterances are related to’"Do you know...” forms. Here, the 
principal intended effect of 6.24 is the "active" ‘response of supplying an: answer, while the 
principal intended effect of 6.25 is the more ‘passive: response. of understanding the 
propositional content. 

The subjunctive expansion above is written to exclude pet actions. : This is relevant 
for forms epee from Rule 6.1. For example, arnaelt sattis 6.26 is.a common type of 
indirect REQUEST, 6.27 is not an indirect REQUEST at alt 

6.26 Have you dusted? 

6.27 Would you have dusted? 

The exclusion of past actions in the subjunctive expansion effectively bars the interpretation 
of 6.27 as a REQUEST to dust. 

An important question that is relevant here is what-event or condition is predicated 
by speech act-related subjunctives. This question has received some attention in the 
literature. One common answer to this question is that the implicit event is “if Pl 
Pertorned the speech, act”, eg., “if I asked: you” for: ee Searle mt out in [30], 
however, that while 6.28 may convey a request, 6.29 may not. . 


6.28 Would you pass me the salt? 
- 6.29 Would you pass me the salt if] asked you to? 


87 


Searle suggests instead that the subjunctive is predicated on “if you please” or some variant. 
The notion of active response used in the subjunctive expansion above complements this 
interpretation. "If you please” is a way for Pl to ask-P2 to look favorably on an action, and 


that action is the active response associated with the successful completion of the speech act. 


6.5. Examples that Fall Outside the Rules 

The final topic for this section is a group of ISAs that are not completely accounted 
for by the approach that has been taken. Consider, for example: 

6.30 Is your leg healed enough for you to go to the store for me? 

6.31 Will you be home in time to walk the dog? 

Looking back at the REQUEST preconditions, these examples seem to. correspond to (II), 
related to P2's capability, but the correspondence cannot be accounted fer by the rather 
restricted matching relationship I am assuming (see Appendix: B). 

In trying to account for examples such as these, my first inclination was to appeal to 
the definition of can in Precondition (II) and expand the matching rules to include matches 
on parts of definitions. That is to say, the semantic representation for can would be 
assumed to have a definition that included ‘ation of physical ability, having free time, and 
being within an appropriate spatial range to do an action. It turns out, however, that this 
treatment puts a great deal of strain on the precondition-based approach without really 
solving the problem exemplified by 6.30 and 6.31. In terms of matching, there is still a good 
distance between a healed leg and the more general notion of physical ability, or between 


the relationship of being at home with a dog and the idea of being in the right range to 


88 


perform the action of taking it for a watk. There ate-defiftite correspondences here, of 
course, but the peit is that extablishing the correspondences can be a complex 
computational problem: This is in contrast’to the approach I ‘have been taking, where the 
matching process:has been carefully: constrained. . 

Although the approach proposed here appears to be too restricted to handle some 
- cases, the answer does not necessarily lie in a wholesale replacement of Rutes 6.1 te 6.10 and 
Appendix B. with a more genera! formulation. Instead, | advocate a. layered approach, 
_ using several sets of rules of varying power. We will have made some progress if we can 
isolate the classes of ISAs that can be derived from properties of speech acts using highly 
restricted rules. These forms constitute seis ciassot ISAs, and their restricted 
nature catr-be ‘exploited in a-computational model. Other cases can then be handled by a. 
more general, and presumably more expensive, inferential mechanism. ‘An example of a _ 


general inferential approach’is found in Afien {1} 


89 


7. A Taxonomy of Indirect Speech Act Forms - 


7 Overview 

In this section | present a taxonomy of speech act forms, with emphasis on indirect 
speech acts. The sets of ‘categories in the taxonomy are to be viewed as different 
dimensions, so that utterances can, and typically do, belong to more than one categery at 
once. | will use REQUEST as a source of examples. The categorization scheme does apply 
to other speech acts, often, however, with some simplification necessary. 

The first distinction that will be discussed is whether the form is direct, indirect, or 
tag. Indirect forms, of course, are of primary interest for-this paper. - It is also, however, 
necessary to consider direct speech act forms for the.same reason that it was necessary to 
clarify speech act organization in Section 2:.one speech act's direct form is another speech 
act's indirect form. Besides direct and indirect forms, J distinguish tag forms. Tag forms 
are sometimes considered indirect forms, but 1 will.argue for. their-treatment as a. separate 
class. 

The next distinction that will-be considered. is whether: an. utterance is single-force, 
double-force, or triple-force. Corresponding to this is the notion of whether a given speech 
act is conveyed as the immediate, secondary, or tertiary force of the. utterance. This second 
distinction corresponds. roughly to Sadock’s meaning/entaitment: distinction, but it is recast 
to avoid the problems described in Section 5. Related to the distinction among forces is the 
question of the existence of speech act idioms. In Section 7.41 take:a. position on this issue 


with the introduction of frozen 1S A forms. 


The final two distinctions, which apply only to ‘what [ have called action-centered 
“speech acts, have received only casual attention in the literature. The first distinction 
hinges on whether Pi names the action involved explicitly or refers to it onty: implicitly. 
The two ISA categories are, accordingly, explicit-action and implicit-action. Fer implicit- 
action ISA forms, there is the further categorization into three types, which for simplicity 
will be referred to with numbers. The difference between type 1, 2 and: 3 impticit-action 
- forms hes in the type of information needed to recognize the action intended by Pt. 


We can now proceed toa more detaited discussion of each set of categories. 


7.2. Direct Indirect and “Fag Forms 

Figure 7.1 illustrates the first distinction for REQUEST: forms. In this figure, slashes 
are used to incticate choices, parentheses indicate optional words or phrases, and angle | 
brackets contain descriptions of words or phrases. | 

Direct forms come in two varieties: performative and simple. The former is Austin’s 
class of utterances in which the speech act is stated explicitly. Simple forms are special 
grammatical or idiomatic siructures that are assoctated ‘ with ‘the: speech act.’ The simple 
form for a REQUEST jis the imperative. Note that the relationship between simple forms 
and speech. acts is not one-to-one. The simple form for giving instructions is, like that for 
REQUESTS, an imperative, akhough possibly modified by-an infinitive phrase (eg., “To 
open, push in and twist.") ‘Warnings atso use the imperative as'a simple form. Not only do 
seme speech acts share simple forms, but other speech acts have more than one, eg., 
dialogue openers with the simple forms “Hello,” : "Hi," etc. Finally, many speech acts, 


particularly ceremonial ones, lack simple forms altogether and rely on the performative for a 


DIRECT FORMS 


PERFORMATIVE 

I request that <action>. 

ex. | request that you write a program to manipulate blocks for me. 
SIMPLE FORM 

<Imperative of action requested> 

ex. Write a program to manipulate blocks for me. 


INDIRECT FORMS 


(1) Will <action>? 
ex. Will you write.a program to manipulate blocks for me? 


(2) Would <action>? 
ex. Would you write a program to manipulate blocks for me? 


(3) Can <action>? 
ex. Can you write a program to manipulate blocks for me? 


(4) Could <action>? 
ex. Could you write a program to manipulate blocks for me? 


(5) 1 want <action infinitive>. 
ex. I want you to write a program to manipulate blocks for me. 


(6) I would like <action infinitive>. 
I'd like <action infinitive>. 
ex. I would like you to write a program to manipulate blocks for me. 


TAG FORMS 


(1) <imperative of action requested>, will you? 
ex. Write me a program to manipulate blocks, will you? 


(2) <imperative of action requested>, would you? 
ex. Write me a program to manipulate blocks, would you? 


(3) <imperative of action requested>, can you? 
ex. Write me a program to manipulate blocks, can you? 


(4) <imperative of action requested>, could you? 
ex. Write me a program to manipulate blocks, could you? 


Figure 7.1. Some examples of explicit-action forms for REQUEST . 


direct expression of the speech act. 

The next class is indirect forms, in which the speech act is conveyed ty a sinpk term 
for another speech act. Figure 7.1 lists a smatl subset of the possible: imirect forms. The 
other sets of categories presented in this section will ae us a ay to look further. at the 
nature of indirect forms. Note that Figure 7.1 dines not include negation of —_ maine 
forms, eg., those involving can’t, won't, couldn't, etc. Recall the Becton: discussed in 
Section 6.3.1 to treate these forms as ISAs’ for the: related speech acts for euewe sand 
persuading (among other possibilities). 

Finally, tag forms are Soneiucied from si sieule form combined with the identitying 
part of an indirect form. The identifying part for most ineirect ferms:is the finite verb plus 
subject (eg., "Move it over, would you?"), but in some cases. it eee more (eg., 

"What's the answer, I'd like to know.). The ISA joni that may have corresponding tag 
forms can be characterized more specifically ay frozen forms, astass which is defined in the 
next subsection. For REQUESTs, the tag forms correspond wo. thi tenerrogative frozen ISA 
forms. This is not, however, necessarily the case for other speech nets Since I can give no 
more precise specification of the ISA forms that have corresponding tag forms, this is left as 
a problem for further research. | 

A separate category has been allotted for tags, since they may function as an arnalgam 
of two forms. (For example, in many contexts "Move 0 over, woutd you?” has the brusqueness 
of the simple form which is then sotieney psaicaneents the. asisdeeacsany aitvcrnactony of the 
more humble "would you”) A view which treats tag forms merely as transformations of 


indirect forms misses the dual nature of the reuse that they: cat convey. 


93 


7.3. The Speech Acts Conveyed 

The next set of categories gives a way to talk about the speech act or acts conveyed 
by an utterance. My first assumption is that some, but not all, utterances convey more than 
one speech act. These different speech acts will be-referred to as the forces of an utterance. 
For the utterances that convey multiple speech acts, the largest number of such speech acts 
that I have found in English ts three. Utterances are therefore referred to as either singte- 
force, double-force, or tripte-force, and the three. forces will be called immediate, secondary, 
"and tertiary. 5 The speech acts are called only: forces, since |. wish to avoid appealing to 
any notion of “meaning.” Although if pressed | would have to say that I see all speech act 
forces conveyed by the utterance as part of the meaning of the utterance, semantic 
investigations are at too early a stage, I think, for a concrete definition of meaning © 

Examples of utterances that convey different anibers of forces are easy to find. First, 
utterances with only an saieaiie joie include direct, simple forms, eG 5.18 (repeated 
below) uttered with an immediate REQUEST — An utterance with both immediate and 
secondary force might be 5.4 uttered with an immediate ASK force corresponding to the 
surface form and a secondary REQUEST force, Finally, there is.evample 5.20 which may 
be uttered with the intent to convey an immediate ASK, a secondary representative, and a 
terilacy REQUEST force. | ? 

5.18 Close the door. 


5.4. Are you:able to clase the door? 
4.20 Must I tell you that it’s cold in here? 


35. My use of the term secondary differs from:its use in Searle {30} 1 hope that this will not 
be a source of confusion. 


36. See, however, Zwicky and Sadock [36] and Sadock [25] for important work in this area. 


94 


To account for the different forces in these examples, let us hypothesize a mechanism 
that applies the rules given in Section 6. -{I postpone to Section 7:5:the issue of how to link 
the statement of the condition in 5.20 with the particular action intended.) With this 
hypothesized mechanism, immediate forces of ISAs are derivable directly from the surface 
form (or a semantic representation of it), secondary ferces are derived by one application of 
the rules, and tertiary forces are derived by two applications of the rules. 

There are two basic difficulties with the mechanism: proposed, both of which are 
familiar from Section 5. First, recall Sadock’s observations that 5.1 {repeated below) is not an 
indirect REQUEST. 


5.1 Tell me if you can move over. 


We need some way to block the assignment of three forces -- REQUEST of a tell, ASK, and 
REQUEST of a move -- to SI. At the same time, it is necessary to preserve the possible 
triple-force reading of 5.22. The second dithicuty with the proposed mechanism is 
illustrated by the examples repeated here: . | 

55 Can you please close the door? 

5.6 «Are you.able to please close the door?. 
With the mechanism proposed, both 5.5 and 5.6 would be produced in ~ same way, and 
there is no way to account for their differing behavior with reaper to Hease 

To take care of the first problem with the a taraal mechanism; we must first amend 
the rules given in Section 6. Each rule must specify that the speech act form used must be a 
simple one, that is, interregative for ASK, declarative for the different representatives, and 


imperative for REQUEST. For example, Rule 6.3 is now: 


Rule 6.3 (for Pt-based preconditions): 

PI can convey a speech act indirectly by -- 

-- a simple form of a representative of the topic of a Pl-based precondition of the 

speech act. 
In addition to this change, the proposed mechanism ‘must be limited to a single application - 
of the rules for afl the ISA forms considered here except for one subclass. The only case 
where double indirection, and hence the production of tertiary force, is uniformly possible is 
a class that will be called type 3 implicit-action forms. 37 This class of ISA forms will be 
discussed in Section 743, and in 754 consideration is given to the question of why stich 
double indirection should be possible for this class. 

This: brings us to the second difficulty with the proposed mechanism. The special 
behavior eee by 5.5 and 56 is one exampke of the type of phenomenon that the idea 
of speech act idioms attempts to account for. Although a relatively small group of ISA 
forms display such special behavior, they have received a great deal of attention in the 
linguistics literature. “pecause of a rather interesting combination of characteristics, these 
forms have been resistant to attempts to fit them-into various thediretical frameworks. After 
several different iterations I find that I come close to subscribing td the concept of speech 


act idioms. I feel, however, that Searle has made a convincing argument against the use of 


ae ea ify § 


37. As ever, there are some borderline cases. Some indirections on secondary forms do seem 
to carry three forces, eg. “I'd like to know if you could drive us to the airport” (an 
immediate STATE.*a secondary ASK, and a tertiary’ REQUEST). ‘The ‘examples I have 

. found tend to be isolated, however, and they will be treated here as frozen fotms (see the 
next subsection). The only uniformly productive:class among those considered in this paper 
are the type 3 implicit-action forms. 


the term idiom. The next subsection, therefore, describes a slightly different class called 


frozen ISA forms. 


7.4. Frozen ISA Forms 
I start the explication of frozen ISA forms. with the intuition that the sentence in rh 


can be uttered as a REQUEST -- not an ASK and.a REQUEST, but simply a REQUEST. 
7.1 Will you close the door? 


The first time that I consciously thought about example. 71), it took a fair amount of effort 
for me to get to the literal speech act force (a question about a future action). Not all ISAs 
are like this of course. An utterance such as 519, repeated below, seems to be. primarily a 


representative, and only secondarily a REQUEST. 
19 It’s cold in here. 


In searching for the source of this intuition, I come to the fact that the ISA forms that 
I want to call “frozen” can or do display special behavior uncharacteristic of the rest of the 


surface form. | therefore define frozen forms as foltows°3 


A frozen ISA form is an ISA form with an immediate force that does not correspond to 
the surface form. For such an utterance, one or both of the following obtains: 


38. It would be nice if, special intonational behavior were associated with indirect 
interpretations of . utterances, as well. The evidence, however, points to special 
disambiguating intonations associated with some direct, interpretations of utterances. (See 
Sag and Liberman {26]) , Atthough this study reports a. strong association between some 
intonation contours and indirect interpretations, the intonations did not decisively 
disambiguate indirect forms. . 


7 


1. It can or does display syntactic behavior atypicat of the rest of the ferm. 


2. It can or does display cooccurrence behavior atypical of the rest of the form. 


This definition of frozen forms is almost a direct sais of Sadock’s criteria for speech 
act idioms into the panna that has been dcvegel here 39 (An addition to this 
formulation, however, is proposed below.) Clause 2 above is illustrated by the special 
cooccurrence properties of example 5.5 as contrasted with example 56 (vepesied tare) | 

5.5 Can you please close the door? 

56 «Are you able to’please close the door? 
An example pointed out by Sadock that fits Clause is 7.2, which i a passive form of 7 
above. —_ * : 


7.2 Will the door be (:please) closed by you? = 


The REQUEST force in’ 71 is; intuitively; more dfrect-than: that-in 7.2. Moreover, the - 
special cooccurrence properties of.7.1 are fost. 

~ The presence of please in 55 is not an-isotated phenomenon, as Sadeck has. pointed | - 
out. Compare: 


‘73 Please, will you close the door? 
7.4 Will you please close the door? 
7.5 Will you' close the door, ‘please? - 
76 Please, it's cold in here. 

7.7 «It's cold tn here please. 


39. Note that Sadock’s “paraphrase” criterion ([24], chap. 5) has been omitted from the 
frozen form definition. While the notion of a paraphtase is useful intuitively, | feel-that. we 
lack a refined understanding of what does, and does not, constitute a paraphrase of an 
utterance. The notion of a paraphrase is therefore not yet a computationally useful one. 


98 


Even where initial please is possible far secondary REQUESTS (e.g., 7.6), Sadock points out 
that its sense is slightly different. This. use of. please is more likely to appear in a 
ee ae that someone = cones what he or she is either about to do or currently 
doing. tn addition. for many immediate REQUEST forms, nee may occur initially 
without a pause, eg., | , | 

: 18 Pike will you close the door? 

79 :Please it's cold in here. 

The non-imperative forms that convey a REQUEST and take, please freely include 
the indirect forms listed in Figure 71. Note, however, that not all cases are as clear-cut as 
the ae given. Consider: | | 

7.10 Would you mind please picking up George at the airport? 

7.11 ?Would you mind picking up George.at the airport please? 

There are a number a Ahese cases for which terminal please seems to be at least marginally 
acceptable. This sort of gradual decay of a criterion, as, opposed. to.a sharp cutoff, is a 
famifiar phenomenon in linguistics. The: way. that frozen forms have been defined, 
however, it is not necessary that the frozen eet for a speech act all display a single type of 
special behavior or that, in this case, please occur freely, It is. ree, that the form ii, | 
display some syntactic or cooccurrence behavior atypical.of the, rest of the form. oe — 

Given the defimition of frozen ISA forms, we can aed. the simple mechanism 
hypothesized above. For each speech act, it appears that the set of frozen forms must be 
represented by its eieedar (This applies to the forms, of course, not to the set of the 


individual: utterances.) The set of patterns for frozen forms can be used to derive 


immediate force, given the surface form of the utterance. For frozen forms, this process 
occurs instead of the normal application of the general ISA rules. 

This is an improvement, but the proposed mechanism is still not complete. Since 
frozen forms have an immediate force that does not reflect the surface form, | must account 
in some way for the response behavior discussed in Section 5.2. It will not do in general to 
say that this response behavior results from processing frozen forms as if they were not 
frozen, eg., as if the REQUEST force for 7.1 were secondary. If this were done, we would 
be at a loss to account for the special surface behavior such as the unexpected presence of 
please in a non-imperative frozen REQUEST form. 

The best answer, I think, lies in assuming that each frozen form pattern has 
associated with it a pointer to the relevant general ISA rule, particularized to a speech act 
appropriate to the surface form. The pointer provides a potential interpretation of the 
utterance, that may or may not be taken. Given this pointer, no information is lost, and 
responses may be keyed off either the immediate force or the general ISA rule.49 Such a 
pointer is needed only for frozen forms. In other cases, immediate force corresponds to 
surface form, and Keying responses off immediate force is equivalent to responding to 
surface form. 

As a final remark on frozen forms, note that they fall somewhere between Sadock’s 
speech act idioms and Searle’s idiomatic but non-idiom forms. It is the addition of the 


40. This treatment of the literal speech act force of a frozen form as only a potential force is 
comparable to Morgan’s notion of short-circuited implicature as it is applied to speech acts 
[21]. Short-circuited implicatures are those whose "literal meaning is ‘latent’ rather than the 
basis for an inference.” The pointer to an ISA rule proposed here for frozen forms appears 
to combine information from the occasion and purpose slots in Morgan's scheme. 


100 


pointer that distinguishes frozen ISA patterns from patterns of idioms. For these “true” 
idioms, the hiteral interpretation of the form is totally lost in normal usage, eg., “How do you 
do?” used in ‘an introduction. exchange. For frozen forms, on the other hand, the pointer 
makes a literal interpretation accessible, if not always accessed. While frozen forms. are not 
idioms, they are also not merely idiomatic, because, by virtue of possible “cross-over” 
syntactic and/or cooccurrence behavior, they are atypical of the speech act that corresponds 


to the surface form. 


7.5. Explicit and Implicit Actions 
We turn now to action-centered speech acts and consider the distinction between 
actions named explicitly by Pl and those referred to only implicitly. Explicit forms are the 
sesiendeest case; example 518 ("Close the door.") is a typical explicit form. Implicit- 
‘action forms are relatively complex; at least three different varieties.of implicit-action speech 
act can be identified. In this subsection, J discuss. implicitaction ISAs, taking each of the 
three types in turn. Due to the complexity of the subject, matter, the account will be fairly 
detailed. 
Before plunging in, a word must be said about the relationship of implicit-action 
ISAs to dialogue context. Implicit-action ISAs rely. heavily om.context. Pi is obliged to 
frame an implicit-action ISA in such a way that P2 can uniquely identify the action, given 
the context. Pl must take into account not only the context, but also PI's knowledge of P2's 
model of the context. Dialogue context plays a progressively greater role for each of the 


three implicit-action types that will be described here. 


101 


Despite this heavy reliance on context, it appears that the difference between implicit- 
action and explicit-action utterances in this seat is basically quantitative rather than 
qualitative. Although more information must be supplied by context for implicit-action 
ISAs, the same types of knowledge are needed for both classes of utterance. This knowledge 
is needed both to choose the correct interpretation for an utterance (eg., to distinguish 
between dines and indirect speech acts) and to round out the specification of an action. See 
Section 12 for a discussion of some of the knowledge sources that come into play for all types 


of ISAs, including both implicit- and explicit-action forms. 


7.5.1. Must you smoke? 

We start with type | implicit-action forms, those for which the action is determinable 
from a combination of the choice of indirect utterance form and the action named. 
REQUESTS do not seem to have any frozen forms of this type, but there is a pattern with a 


secondary REQUEST force, 7.12. 
712 Must you <action>? 


Paraphrases of 7.12, eg, "Do you have to <action>?” are also type Il. The use of 7.12 
indicates that a stopping or avoiding action is desired, and in particular the action to be 
stopped or avoided is the one named. For example, “Must you talk so loud?" is a request 
for either the impliet action “Stop talking so toud” or “Aveid talking so foud.” 


Example 7.12 and ts paraphrases can be accounted for by the following rule: 


Rute 71:4! 
Pl may ASK about PI's poner to sttoain an action 
: co instead af --. 

making a REQU EST by ASKing whether P2 can stop or avoid the action. 

(Rule 65 acting on REQUEST precondition I.) 
Rule 7.1, and the other rales that will be give) for imp actions, represents a link between 
an ISA afer (the first halt) and either 2 a simple form or a 2 pater produced bya one of the 
geriéral rules from Section 6 (the second hal) 42° 

Example 7.12 and its paraphrases are the only type 1 implicit-action REQUESTs that 
I have found, but we have the SUGGEST forms “Why not <action>?” and "Why <action>?” 
For these forms, the suggestion is “do <action>” and "stop or avoid <action>", respectively: 

The important feature of Rule 7.1 and the other rules in this class is that the choices 
of intended action (stopping or avoiding the action) are listed explicitly in the rule. Context 
plays a role in. determining which of the two was intended by Pi, but. the operation. for — 


determining the implicit action is a choice, not an open-ended search. 


_ 75.2. Can you reach the salt? 
For the next implicit-action form, type 2, P! has some action in mind but the implicit 


action is not determinable from surface form alone. Type 2 implicit-action forms name 


41. Earlier versions of Rule 7.1 and the other rules for implicit-action forms explicitly stated 
that the action intended by Pl steautt be clear to P2 fram-context.: | naw. believe, however, 
that this condition is part of the more general constraint to avoid ambiguity, which applies 
to direct as welt as, indirect speech acts. (See Section 6.) Ht is therefore ‘net necessary to 
restate the principle in each implicit-action rule. 


42. Note that Rule 71 is explicitly tied to REQUESTs. It is not clear that implicit-action 
rules need to be tied so closely to individual speech acts. I have found some rules shared by 
pairs of action-centered speech acts, but further investigation in this area is needed. 


some component of the implicit action, and the implicit action is determinable from 
knowledge of the structure of actions plus knowledge about the context of the utterance. 
Context is necessary because the action component named may be associated with more than 
one action. | 

More specifically, for a definition of component 1 draw ‘on the model of actions 
introduced in Section 4. A component is either a state that is a prerequisite of the implicit 
action, a PRINCIPAL-RESULT of the action, one of the action’s semantic input cases, or a 


subaction.4° 


In this subsection, we examine rules based on each of the four types of 
component. 
We start with a type 2 rule based on prerequisites: 
Rule 7.2: 
Pi may ASK about or utter a representative about a sfative prerequisite of an action 
where P2 is in some way a participant in the state 
-- instead of -- 
makine a REQUEST by ASKing about or uttering a representative about whether 


2 can take responsibility for carrying out the action. 
(Rules 6.5 and 66 acting on REQUEST precondition 11.) 


Rule 7.2 can be illustrated by an example of Searle's, the use of 7.13 to convey a REQUEST. 
713 Can you reach the salt? 


Here, the implicit action is that P2 pass the salt (either to Pl or to someone else clearly 


identifiable from the situational context). Instead of naming the action explicitly, Pl names 


43. The subactions that can be used in type 2 i asad action forms peels to be restricted to 
standard path steps (as defined in Section 4). 


104 


a Stative prerequisite of the action. Note that naming a stative prerequisite does mot always 
imply the action. One may ask "Is-the sak ote you?” or “Can you reach the sak?" for “Can 
you pass the salt?” but 7.14 or 7.15 de not imply 7.16 or 7.17 in response to a request to 
deere the dinner: 


714 The salt was near Harry. 

7.15 Harry was able to reach the salt. 

7.16 Harry passed the salt, 

‘7.17 Harry was able to pass the salt. 

748 LT asked for the salt.and:-Harry was able to reach it. 


Example 718 does imply that Harry salsa the salt, but an ae the speech act context 
was reported. Thus, the action implied is a ie ale) of the ae ee act context 
and not a property of the nature of stative prerequisites alone. 

Turning from ae the next uch aie type is the PRINCIPAL-RESULT. 


Two rules are of interest here: 


Rule 7.3: 
PI may utter a representative about a desire or need‘? - 
for a state or object that is the PRINCIPAL-RESULT of some action, where the 
desire or need is PI's or occurs for someone PI is empoee to len for, 
oo ss dasteadofss. 
faaking a REQUI EST by uttering a representative about a comparable desire or need 
for P2 to take responsibility for carrying out the action. 
(Rule 6.3 acting on REQUEST precondition 1.) 


Rule 7.4: 
PI may ASK whether the PRINCIPAL- RESULT of a Gesired action obtains 
. *- fnstead: of -- i 
making a REQUEST by ASKing whether the desired action has been done. 
(Rule 6.1, Clause iv) 


44. Desire and need together are equivalent to want as 1 have been using it. I used 
“desire or need” here to make the ruleread noes but apaik sey can be. replaced by 
want without changing the sense of the rule.» : 


105 
An utterance related to Rule 7.3 is 7.19, which has an immediate REQUEST force. 
7.19 1 want this room clean when I come back, please. 


This example illustrates a property of those implicit-action forms that involve a 
PRINCIPAL-RESULT. In uttering 7.19, Pl may or may not have a particular action in 
mind. On the one hand, Pl may want P2 to call the cleaning service, a particular action. 
On the other hand, Pl! may wish to convey only a desire that P2 find some way to achieve 
the general goal “get the room clean.” In this second case, Pl leaves the choice of a 
particular action to P2. Note that | will still consider this second use of 7.19 to have a single 
implicit action (eg, “get the room clean"), keeping in mind, however, that the action is 
specified only in a general way. 

Moving on to the next PRINCIPAL-RESULT rule, Rule 7.4 is similar to 7.3, 
producing REQUESTs such as 7.20. An explicit-action ISA that corresponds to 7.20 is 7.21, 
which ts based on Rule 6.1, Clause (iv). 

7.20 Do you have the letter? 

721 Did you pick up the letter? 

The component relationship should be clear in this pair of examples: having a physical 
object is the PRINCIPAL-RESULT of a pick-up-physical-object method. In turn, the 
picking up action done is the principal intended effect of an unuttered REQUEST. 

We come now to the third componential relationship, semantic input cases of actions 


named. An example utterance is 7.22, spoken as a REQUEST that Pl light P2's cigarette.4° 


45. If, on the other hand, 7.22 is uttered as a REQUEST to give Pl a match (or matches) 
then I would consider the relationship involved to be prerequisite and the utterance to be 
derivable from Rule 7.2. 


7.22 Do you have a match? 


Here, matches are one possible INSTRUMENT in a light-cigarette method. Thus, we get 
the fallowing rule: 
Rule 7.5: 
PI may ASK whether P2 has <item that fills an INSTRUMENT slot> 
-- instead of -- 

making a REQUEST by ASKing whether P2 has <item that fills an INSTRUMENT 

slot> for/to <action> 

(Rule 6.10 acting on the INSTRUMENT case) 
It is not clear to me what other semantic cases may appear in-type 2 implicit-action forms. 
Certainly, type 2 ISAs based on semantic input cases are: similar to the explicit-action forms 
discussed in Section 6.3.2. It appears, however, that type 2 implicit-actions are derived from 
a mare limited set of semantic cases. This is a question that I leave to further research. 

This brings us to the fourth, and final, type of component; subactions. An example is 
7.23, which in the proper context REQUESTS not only. a transfer ‘but also a creation action 


(i.e. write the program). 


7.23 Give mea program that builds an arch. 


Creation actions bring a new object, either physical or mental, into existence, and creation 
actions performed for someone else have a transfer step near the end of the standard 
path(s). In this transfer step, the AGENT of the action transfers control of the object that is 
in the PRINCIPAL-RESULT to the BENEFICIARY. This transfer subaction may be 


named instead of the creation action in a type 2 inpliceedicn REQUEST. 


107 


To summarize what has been said about type 2 forms, we can say that these forms are 
recognizable with reference to general knowledge about the components of different actions. 
Components are the prerequisites, PRINCIPAL-RESULTs, semantic input cases, and 
subactions described in Section 4. Type 2 implicit-action rules include some that apply to 


actions in general and some that apply to specific classes of actions. 


7.5.3. It's cold in here. 

Finally, we come to type 3 implicit-action forms. Type 3 forms are those in which a 
state or action named constitutes a basis for the implicit action. The notion of a basis for 
action will be discussed further below. A standard example of type 3 forms found in the 


literature is 5.19, repeated below, used as a REQUEST to close the door. 
819 It's cold in here. 


The rule that characterizes type 3 implicit-action forms for REQUESTs can be stated 


informally as follows: 


Rule 7.6: 
Pl may utter a representative about a basis for action where 
PI wants P2 to take responsibility for carrying out an action that remedies an 
undesirable state?® 
(This state constitutes the basis for action or has a special relationship to it. More 
specifically, the state is "S!" in the definition of a basis for action, below.) 


46. In uttering a REQUEST derived from Rule 7.6, Pi reflects a desire to remedy an 
undesirable state. Thus, the negation of the undesirable state is a PRINCIPAL- 
RESULT desired by Pl In the last subsection, 1 said that implicit-action forms 
involving a PRINCIPAL-RESULT could be uttered with or without a particular action 
in mind. We see this quite clearly for utterances derived from Rule 7.6, as evidenced by 
the following common sort of ambiguity: PI: “It’s cold in here.” P2: "What do you want 
me to do about it?” Pi: "I don’t care. Just do something.” 


108 


-- instead of -- 
making a REQUEST by uttering a representative about PI's desire or need for the — 


action. 
(Rule 6.3 acting.on REQUEST precondition 1.) 


A basis for action may be defined as follows: 


ji. a state (SI) that is undesirable 
either: 


because it is in conflict with some goal in the context (from the local context to the 
very general “social context") 


or 
because when Sf is combiried with other: states-in the context it results in a state 
2, where S2 is in conflict with some goal, as above 

li. a symptom of SI 

Here, a symptom is a state S3 that typically coexists with Si, where when S3 occurs one 

is warranted to check whether SI obtains. 

iii. a signal for SI 

A signal is the commanication step of a process-that is set. up to monitor something 

else -- eg. another. process, a physical location, etc. -- for the existence of some state. 

This is quite.a bit of material, some examples may belp to clarify the definition. First, 
example 5.19 above can be thought of as reporting a state that is in direct conflict with the 
goal of maintaining a comfortable temperature (part (i) of the definition). This type of 


basis for action can be used as an answer to a why-question asked by a non-participant in 


109 


the speech act, eg. 7.24.47 
7.24 Pi: Why did you close the door? 
P2: Because it was cold in here. 
An example of a symptom used as a basis for action is the following: 


7.25 The cake is beginning to smell. 


A cake smelling is a symptom of its being done. The fact that the cake is in the oven (a 
contextual state) combined with the fact that the cake is done can result in'a burned-cake 
state, which is generally in conflict with ie goal of baking a cake.48 Two remarks can be 
made here. First, while this sort of analysis of states can always be done, at least in many 
cases | do not believe that it must be done in order to recognize an-1SA. -1 will expand on 
this point below. Second, note that a symptom need not name a Sipisant situation, as 
7.25 illustrates. 


Finally, we come to signals, an example of which is 7.26. 


7.26 The buzzer went off. 


The oven buzzer could be a signal for the fact that the cake is done, and, by a chain of 
States similar to that for 7.26, this state could lead toa conflict with the goal of having an 


edible cake. 


47. I specify non-participant here because | suspect that, if the question in 7.24 is asked by 
the participant making the REQUEST, then the preferred answer is something tike 
"Because you told me to.” 


48. This combination of states exemplifies the second half of part (i) of the basis for action 
definition. 


110 


There is still more to be said about the notion of a basis for action. In particular, 
there is the question of how a basis for action can be related to the general. structure of 
actions developed in Section 4. Since the qiceer to this question involves quite a bit of 
additional detail, it appears in Appendix D. This permits us to move directly to a 


discussion of the special properties of type 3 forms. 


7.5.4. Special Properties of Type 3 Implicit-Action Forms 

In Section 7.3 | observed that the type. 3 implicit-action class. constitutes an exception 
to. the single indirection limit. In addition to this, there is another property displayed by 
type 3 forms. Compare: 

- 7.27, Get me a cup: of coffee, and I'd like something to eat, please. 

7.28 Get me a cup of coffee, and do you have any pie? 

7.29 «Get mea cup of coffee, and I'm hungry:. 
Example 7.27 shows a frozen REQUEST form conjoined with a simple form, and sails 
7.28 contains a type-2 implicit-action form accounted for by Rule 7.2. (Not all type 2 implicit- 
action forms are as acceptable as 7.28, but all that I have looked at are at least marginally 
acceptable when conjoined in this way.) Finally, 7.29- shows a type 3 implicit-action 
REQUEST that cannot be successfully conjoined with the simple form. It appears that type 
3 implicit-action REQUESTs ars the only sort of REQUEST forms that may net be 
conjoined with simple forms. | 

The uniform possibility of double indirection and the special conjunction behavior 
support the treatment of type 3 forms as = tiharae class. We still; however, need an 


explanation of why such behavior should be exhibited by type 3 forms at all. While I 


cannot answer this question in detail, 1 suggest an account based on the degree of 
independence of the speech act forces of an utterance. 

For type 3 forms, the message conveyed by the representative speech act force is 
relatively independent of the action-centered speech act force. The immediate force of a 
type 2 implicit-action form, for example, names a component of the implicit action, which is a 
Close relationship. In contrast, the “basis for action” relationship in type 3 forms is one in 
which the state or action named is more separate and -on a. mere equal footing with the 
implicit action (see Appendix D). 

To the extent that a speech act force is independent it can be seen as carrying its own 
separate message. The message conveyed by a representative force of a type 3 ISA is 
apparently separate enough that it merits the possibility of the additional modulation 
afforded by indirection. Similarly, the representative message of a type 3 form is 
independent enough of the other force(s) that, when conjoined with a type 3 action-centered 
form, the representative force competes with the action-centered force for a place in the 
semantic interpretation of the conjunction. (The difficulty here is that simple fecreientatives 
do not normally cnioii with simple action-centered seach gah | _ 

Type 3 forms, then, exhibit special surface behavior. 1 suggest an explanation of this 
behavior based on semantic properties used to define ISA classes. The explanation seal 


to a notion of the degree of independence between the speech act forces of an utterance. 


H2 


7.6. Summary: Implications for the General ISA Rules 

The categories given divide utterance forms. into direct, indirect, and tag. Utterances 
are said to be single-, double:, or triple-force, and a given speech act may appear as the 
immediate, secondary, or tertiary force of the utterance. Frozen ISA forms are those forms 
whose immediate force differs from the speech act in a literal interpretation of the surface 
form of the utterance. REQUEST forms (and these of other. action-centered speech acts) 
specify actions either explicitly. or implicitly, and implicit-action forms are further classified 
into three types. Implicit-action forms are distinguished according to.whether the intended, 
action is derivable using surface form:{type 1), whether . it: is named -by..a component of a 
method associated wittr the action {type 2), or whether a “basis for action” is appealed to 
(type 3). 

The impact of this taxonomy on the general ISA rules (Rules 6.1 to 6.10 from Section — 


6) can -be summarized as follows: 


Additions 
1. The general ISA rules are augmented by implicit-action rules as described in Section 75. 
2. The general ISA rules are augmented by a set of frozen form patterns as described in 


Section 7.4. 


Limitations 
1. The general ISA rules are amended to permit only simple speech act forms as discussed in 
Section 7.3. 


2. The general ISA rules are applied once only, except as noted below. 


3 


Production of Forces 


Immediate Force 
Frozen ISA Forms: produced by a combination of implicit-action rules as necessary 


and the patterns for frozen forms 


Others: directly derivable from a combination of implicit-action rules as necessary and 
the surface form of the utterance 
Secondary Force 


AIL ISAs other than frozen forms have a secondary force derivable by a combination 
of implicit-action rules as necessary plus one application of the general ISA rules. 


Tertiary Force 
Of the forms examined here, only type 3 implicit-action forms (uniformly) may carry a 


tertiary force. This force is derivable by a combination of the type 3 implicit-action 
rules plus two applications of the general ISA rules. 


This empty page was substituted for a 
blank page tn the original document. 


115 


PART 2 


COMPUTATIONAL IMPLICATIONS 


This empty page was substituted for a 
blank page tn the original document. 


WW 


8. Computational Perspectives on ISAs 


8.1. Drawing Computational Implications from the ISA Categorization 


The first part of this paper developed a characterization of indirect speech acts and 
justified it with traditional (non-computational) sorts of linguistic arguments. Now, in Part 
2 some of the computational implications of this ISA categorization will be considered. The 
main point of Part 2 is that the distinctions wade in Part 1 between ISA types occur at 
precisely the places where computational processes should be distinguished. 

The computational implications that | wish to explore will be presented in the context 
of a process model for ISA recognition. 49°That is to say, ! will be concerned with relating 
utterances to an appropriate underlying representation. The following two questions will, 
together, provide a focus for Part 2: | 

1. Given an utterance form, how can one represent, in a computationally useful way, 

the possible speech acts it can realize? 

2. How can the system for characterizing ISAs from Part 1 be related to a view of 

recognition as essentially a path-building process?, The paths in question are links 


between the speech acts conveyed by an utterance and. nodes on a tree representing 
the events of the dialogue. (This view will be developed in Section 91). 


49. | use the word recognition rather.than understending to emphasize the importance that 
expectations will play in the process model; see Section I. Note that only recognition will be 
‘considered here. J. see generation as at once an easier.and amore. difficuk problem. On the 
one hand, the speaker is spared some of the problems of ambiguity and incomplete 
knowledge about what is said, making the generation. process simpler. On the other hand, 
the generation process is complicated by a need to take the recognition process of the hearer 
into account, to avoid ambiguity when this is necessary or desirable. 


118 


For the answer to the first question, | draw on the distinctions from Part I. 
Depending on category, the model will use explicit lists of realization patterns or rules at 
various levels of generality. 

In answering Question 2, I will be concerned with the type and number of paths to be 
constructed for each utterance form. The immediate/secondarytertiaty distinction from 
Section 73 is seen as directly affecting this question, with one path built for each force. 
_ Another factor here is the direction i: which paths sidaiaraien choice of direction has 
an impact on the representation of ISA forms. | oe 

Although these questions were stated separately, they se actually strongly interrelated. 
Accordingly, after a general discussion of recognition in Section 9, I discuss the questions 
together in Sections 10 and Il and go on to took at some aspects of the role of context in 
Section 12. Note that the answers to the two sacle are not intended to be complete. In 
no sense am | attempting to present a full computational model of ISA recognition. I have 
concentrated on what | see as the more important computationat implications of the theory 
presented in Part 1, with the hope that the ISA recognition scheme outlined in Part 2 will 
eventually form the basis of a more complete computational model | 

Every effort ‘has been made to make “the _ discussion of ISA retognition 
senplenvehnarion-independent It is worth mentioning, however, that parts of the model have 
been implemented in a system described in [3] This system, called Susie Software, was 
designed as a prototype dialogue module for an automatic programming system: While the 
system's: debugged behavior was limited to a twenty fine dialogue (plus some close variants), 


it is fair to say that the implementation did nothing to disconfirm the mode! presented here. 


119 


The implementation process did, however, reflect negatively on some earlier versions of the 


model, and in that sense it was an important exercise. 


8.2. A Look at Related Work in Dialogue Modeling 

Before plunging in, it may be useful to see where the approach to dialogue (and ISA) 
recognition that I am advocating fits in with other recent work on dialogue. I start with the 
OWL-I representation of actions discussed in Section 4. 

OW1L-I methods represent chunks of knowledge in the tradition of Minsky's frames 
[19] and Schank and Abelson’s scripts (27), and as such they should be familiar types of 
structures. Both of these approaches are geared to representing common sense knowledge, 
frames primarily from the point of view of visual knowledge and scripts from the point of 
view of social actions. Although Minsky does discuss the application of frames to language 
understanding, he does not address the special problems posed by dialogue, where language 
generation and recognition are intimately related. Schank and Abelson’s focus on social 
actions brings them closer to problems of dialogue, but their representational scheme is quite 
different from the one adopted here. Scripts are built from a small set of primitive actions, 
in contrast to the open-ended set of actions used in the OWL-I_ methods. Despite the 
differences, the three approaches have one very important, point in commen. All three focus 
on chunks of knowledge whose interrelationships are used to form expectations to guide 
recognition processing. This idea plays , central role in the framework described here. 

_ Of the specifically language-related research, four efforts are expecially relevant here. 


The first is work by Grosz [812], who focussed attention on the importance of task 


120 
knowledge for dialogue. A semantic network was used to represent the structure of tasks 
such as assembing an air compressor, and dialogue utterances were interpreted with respect 
to this task knowledge. Using this approach, Grosz has been able to develop important 
insights on reference and pronominalization. | - | 

Although Grosz's work shows how task knowledge affects dialogue, there is no 
explicit model of more general dialogue knowledge. One early effort to represent both 
linguistic and non-linguistic actions uniformly was that of Bruce {5). The OWL.-1 dialogue 
ftethon is quite close to Bruce's social action paradigm, in that both are centered on 
semantic cases and both were developed with an eye to incorporating speech acts. The 
primary representational difference is the distinction made in OWL-I methods between 
standard and recovery paths (Section 4), which has important implications for processing as 
well as for representation. 

Bruce's approach, then, points the way to an integration of linguistic and non- | 
linguistic knowledge in a single representation scheme. There is stifl the problem, however, 
of showing how such general representations apply to particular situations. (in [5], Bruce 
discusses the use of social action paradigms for story understanding but not for dialogue.) 

A group that has worked on a mechanism for relating representations to individual 
dialogues is Moore, Levin, and Mann (see (20). Their work is specifically dialogue-related, 
and their representations for what they call dialogue games are spiritual cousins to Bruce's 
social action paradigms and to OWL-I dialogue methods. The major difference is in the 
approach to control structure; general dialogue game structures are related to particular 


dialogues by a group of processors that work independently and in parallel [16]. In contrast, 


121 


the design that will be discussed in Section 9 and Section Ii has a highly centralized control 
structure where every effort has been made to identify limited sets of information necessary 
for each choice. This means that an attempt has been made to limit searches and match 
attempts by structuring information so that the system knaws where to look for its 
information. To an extent, this view is influenced by the current serial machine 
architectures, but IT suspect that the more structure we can identify in knowledge -- both 
Ainguistic and non-linguistic -- the inet successful will be our models. 

Another group that has been scene with both representation and mechanism for 
dialogue 1s Cohen, Perrault, and Alten (Cohen (6) Pecrauk et al. [22]). They have been 
interested in sppcttyine’d general, intereneebaged ssecharilich for choosing and receouiiing 
" speech acts, given the facts of a situation and the goals of the speaker. Speech act 
generation is viewed as a general planning problem, and recogition is viewed as a plan- 
recognition problem. This approach differs from more strongly expectation-based 
approaches, which include the other efforts described here as well as the one to be 
presented. 

Within this plan-recognition framework, the work that is of most relevance for us 
here is that of Allen [1], who concentrates on ISA recognition. Of the three theories 
discussed in Section 5, Allen's framework is closest to that of Searle. The model that Allen 
presents uses a general inference mechanism to identify speech act forces beyond what I am 
calling here the immediate force. The inference mechanism uses knowledge about the 
structure of actions, as well as knowledge about the planning process. As noted above in 


Section 6.5, such a general approach is probably necessary for handling some types of ISAs. 


122 


My claim, however, is that, for a large class ef ISAs, there. is knowledge that can be 
exploited to avoid deriving a form “from scratch" each time it is:recognized.. The design 
discussed here relies heavily on predigested: information associated: witty ISA patterns. This 
allows more aggregated operations, so that one operation may correspond to a segment 
containing several inferences in Allen's system. 

To ve up, the approach that wi be taken gives a central poston to expectations in 
the recognition process. It os the Importance of task knowledge in Lame these 
| expectations, integrating this knowledge in - with Linguist knowledge in a uniform 
representation. Efforts have been made to consolidate contro! structure where possible, and 


an attempt has been made to tailor operations to different categories of ISAs, in order to 


exploit knowledge that we have about them. 


123 


9. A Framework for Recognition Processing 


The recognition process for ISAs cannot be discussed outside of a general model of 
recognition. In Section 9.11 briefly discuss the system configuration | am presupposing, and 


in Section 9.2 1 go on to recast recognition as a path-building process. 


9.1. A System Configuration for Recognition 

In this Saice, I discuss only those aspects of recognition that are required for the 
discussion of ISA recognition. Figure 9.1 shows the recognition processing configuration 
that the ISA modet assumes.” Many aspects of the configuration in Figure 9.1 are 
influenced by the wish to model dialogue as, essentially, the execution of OWL-I methods. 
This means that a knowledge base is needed to ‘hold the methad library. (among other 
information), a knowledge base processor is needed to maintain it, and a method initia 


BN 


is needed to move from step to step and invoke and terminate methods. 


Knowledge Base and Knowledge Base Maintenance 


The Knowledge Base contains concepts, the basic unit of OWL-I, and structures built 


out of these concepts. These structures include OWL-I methods to represent processing 
knowledge as well as descriptive structures for classes of static concepts (eg. substantives 


such as toy blocks or computer consoles). The Knawledge Base. Maintenance facility needs 


50. Figure 9.1 has many points in’ common with the: OWL-I implementation.done at the 
M.LT. Laboratory for Computer Science by William A. Martin, Lowell Hawkinson, William 
Long. William Swartout, Alexander Sunguroff, and the author. That system contained 
running examples of everything in 9.1 except the English Front End. 


124 


USER 


ENGLISH FRONT END 


REFERENCE MATCHER 


METHOD INTERPRETER 


 CARRY-DUT RECOGNIZE | ASSUME | 
EVALUATE 


KNOWLEDGE BASE MAINTENANCE 


EVENT TREE 


KNOWLEDGE BASE 


Li 
iL. 


Figure 9.1. The recognition system assumed 


125 


the ability to add new concepts and structures to the Knowledge Base and to modify old 


ones. 


Interpreter 

It is the Interpreter’s job to “execute” method steps to generate and recognize 
utterances in the dialogue. If request-and-respond, ask-and-answer and the other dialogue 
methods are speaker independent, then it is up to the interpretation process to determine 
whether a particular utterance is to be generated or recognized. There are, in fact, three 
possible modes of interpretation™! for a step in a dialogue method: 

1. Carry out the step (e.g., ask a question). In dialogue, the most important role of 

Carry-out is to generate utterances, but it is also used to execute “mental” actions. 


2. Recognize that a step has happened (e.g., that an answer to your question has been 
given). 


3. Assume that a step has happened (eg., if your conversational partner gave the 
answer, then he or she had to perform the mental process of finding the answer first.) 


A set of simple rules is adequate in most cases to determine the mode of a step from the 
semantic input case settings in the procedure call.52 


In order to introduce other system modules, | will expand on the Interpreter's Carry- 


5I. Throughout Part 2 of this paper, the words interpret and interpretation will be used to 
indicate the interpretation of methods. The meaning of interpret found in natural language 
inter pretation (as opposed to natural language generation) will be conveyed by recognize. 


52. Semantic input cases were discussed in Section 4. In some conversational settings it is 
ambiguous whether, say, recognition or merely assumption should occur next. “Constructed” 
dialogue environments can avoid this problem. by careful. design. , In other dialogue 
environments, clarifying discussion can often be used to determine the currect mode in an 
ambiguous situation. 


126 


out mode. (Recognition mode is discussed further in the next subsection.) Stative steps (eg. 
results) are handled in Carry-out mode by simply asserting them into the Knowledge Base. 
The handling of non-stative method steps is more ee Given a non-stative step, 
Sane out first evaluates it with respect to the current environment. AEvahuate is discussed 
below) 1 This evaluated step is then treated asa procedure call and matched against Rrown 
methods to find appropriate subprocedures Pesteretions on what can fil each input case 
associated with a method are ‘used in this mmalching pies The use of the evaluated call 
means that the choice of a method is heavily influenced by the progress of the dialogue to 


date. 


Evaluate 

OWL-I forms are a combination of regular concepts and variables. The module 
Evaluate takes these forms and tooks up and binds appropriate values of the variables. | 
The evatuation process is sitvays done with respect to:some environment, for example when 
a call is evaluated with respect to the current dialogue environment before the search for a 


method to carry it out. 


English Front End and Reference Matcher 

The English Front End consists of morphological and parsing modules for typed 
input and includes additional signal processing ‘mitéutet for spoken input. The Reference 
Matcher is not limited to resolving noun group references; instead, the ‘entire utterance is 
seen as referring to semantic structures, and the task of the ‘Reference’ Matcher is to identify 


these concepts. This identification process ¢ an either be done by composing new 


127 


representations, by verifying that the utterance matches an expected representation, or by 
some combination of the two. Note that the arrows in Figures 9.1 and 9.2 are intended to 
mark the dominant flow of control. Some interaction with adjacent modules is expected for 
both the Front End and Reference Matcher. 

For our purposes here, it is the output of the Front End and the Reference Matcher, 
rahe? than their internal operations, which is of interest. The representation that we need 
from the Front End will be called surface semantic representation, sonamed because its 
elements are semantic structures but its form corresponds closely to the surface English of 
the utterance. The Reference Matcher takes this representation as input and produces as 
output Interpreter level representation, the semantic tangusge ‘used by the Interpreter. — 
Figure 9.2 illustrates the sorts of representations that I am assuming. The lexical items 
correspond to OWL-I concepts. Parentheses are used to establish grouping relationships; 
they have no relation at all to LISP parentheses. The primary differences between the 
representations in Figure 9.2 is that, where possible, referent identification and semantic case 
assignments have been made in the Interpreter level representation. Further details of the 


representations assumed are given in Appendix E. 


Event Tree | 

The Event Tree is buik by the Interpreter as a. record of methods executed in the 
course of a dialogue. In this regard, the Event Tree can be thought of.as intermediate term 
memory, used to record the current dialogue and organized chronologically. Past events are 
not removed from the tree; so that they are available for: inspection, question answering, 


resumption (in the case of uncompleted events), etc. 


128 


Can you pass the salt? 
| Surface Semantic Representation 
(SAY INTERROGATIVE) 
(CAN 
((PASS (SALT THE)) 


(SUBJECT YOU)) 
(SUBJECT YOU)) 


ms 


| Interpreter Level Representation 
(ASK 


(WHETHER 
(CAN | 
(PASS SALT-I) 
AGENT: P2. 
DESTINATION: Pi] 
AGENT: P2] 
AGENT: PI 
DESTINATION: P2] 
_ Figure 9.2 Example representations-for an utterance that conveys (only) 
an ASK. 
It is to the branches of the Event Tree that | am referring when | speak of 
"recognition as a “path-building” process. The branches of the Event Tree correspond to 
substeps of methods. Figure 9.3 is a very simple example of an Event Tree configuration 
for a computer console session environment in which a question is asked by the user and 
answered by the system. Note that each node on a branch either has a subpart relationship 
with its superior (e.g., ASK and ask-and-answer) or it is a more specific description than its 
superior (eg. "find out seme information”..and ask-and-answer). Surface ‘semantic 


representations of ‘utterances do not appear as: separate nodes on the tree, but they are 


imstead associated. with the speech. act node. The association between surface semantic 


129 


representation and event node corresponds to the relationship between ISA and speech act. 
Note that the example in Figure 93 is quite a bit simpler than most. Once a dialogue gets 
underway there are typically many levels of subcalls on the Event Tree. branches. 

Two of the speech acts in Figure 9.3 have not been introduced previously. The first, 
STATE, is a representative that conveys what PI believes to be a fact.. Quite a bit must be 
said in order to adequately define the notion of a “fact” but I appeat instead to the reader's 
intuition. The other new speech act, ACKNOWLEDGE, is atso a representative that 


conveys information about PI's status with respect to the dialogue. It isa sort of marker, 


indicating that Pl! has completed a step in the dialogue and is ready to continue. 


9.2. The Event Tree and Recognition Mode 

This section contains a very condensed outline of the recognition process that I am 
advocating, which I have characterized as a -path-buitding process.” Although the 
information content of this section. is dese, everything relates to a single major point. This 
point is that path building is a rene a occurs in an environment of incomplete 
knowledge, so that it is computationally sport in what direction path building is 


53. There.is another way. in which the treatment described is a simplification. In many 
dialogue situations, utterances occurs whose motivations are learned only later. These 
motivations could have an effect on the choice of some:of the higher level nodes on the 
Event Tree path of the original utterance. (The level of concern here is the “get some 
information" node in Figure 9.2.) The usual options are available to the system builder: 
attaching a path to the tree can be deferred until further information is available, the 
system can guess and back up, or minimal commitments can be made (eg., “get some 
information” can be chosen temporarily as a motivation). If the last alternative: is chasen, 
the descriptions on the event path can be refined as more information comes in. For the 
dialogue modelling task, this third alternative is probably the most suitable in general. This 
approach avoids problems of non-uniform.representation that ecair when attachments: are 
delayed, and it avoids the need to undo premature and erroneous commitments that may 
_ occur with a guess and back up strategy. 


Participate in a console session 
agent: PI (the computer system) 
co-agent: P2 (the user) 


get some information 


ask-and-answer. a question 


ASK a question 
agent: P2 
destination: P! 

i 


decide whether to answer 
agent: Pl 


find the answer 


agent: Pl 


‘ 
4 

t 

{ 

1 

4 

{ 

f] 

i Te 

' state-and-acknowledge 
‘ the answer 

‘ agent: Ph. > 

i co-agent: P2 
1 . My 
’ 
i] 

{ 

t] 

' 

' 

| 

5 

1 

t 

,] 


STATE the answer ACKNOWLEDGE the answer. 
agent: Pi  agentsP2° 


t 

! 

if 

( 

4 

ea ' 
“What time is it?” "10:05 A.M.” “Thanks.” 


Figure 93 An Event Tree configuration: with paths fora question/answer. exchange. 


131 


attempted. I outline here a suggested approach based on a distinction between utterance — 
types. 

Recall that methods aie being tsed to model a.task-oriented dialogue from the point 
of view of one of the participants (say, P!) so that some method steps correspond to all or 
part of the underlying form of generated utterances and some provide undedipitig forms for 
utterances that are recognized. Once PI uses a method step to generate an utterance we can 
identify a set of patterns that are highly likely as underlying forms of the response by the 
other participant P2, given that the dialogue continues along the same lines. This set of 
patterns includes the possible active successor steps to the generated step, the lead-ins (i.e. 
first step executed in recognition mode) to recovery paths related to the generated step, or 
lead-ins to recovery paths related to its successor steps. (Recovery paths were introduced in 
Section 4.) Here, active is used to rule out strictly conceptual steps such as thinking of an 
answer. Since P2 need not respond only to. PI's goals, I include two other types of pattern, 
initiator and metadiscussion. Initiators are utterances that start off new, independent tasks 
(as opposed to substeps-of tasks currently underway). Metadiscussion utterances are about 
the dialogue rather than strictly step-derived. They tend to clarify.or change the flow of 
the dialogue. (For more.on these different:types of utterances, see Brown [4]. The different 
patterns corresponding to the different types of: utterances. will be called structural 
expectations. “4 Structural expectations developed from the current isi a of the 
54. The term structure is used here for what others might call the syntax of dialogue. Many 
“structural” phenomena are semantic in flavor (although they do not necessarily vary 
according to the specific semantic domain), and the use of the term syntax might be 
misleading. For example. the fact that questions get answers can be called “dialogue 
syntactic,” hewever, the-fact, that: agswers may have associated stipulations ("Yes, if...) or 


"qualifications ("Yes, but..") begins to stretch the term syntactic. The expectations generated 
from methods and the general task environment are therefore referred. to as structural. : 


Event Tree can be viewed as "on deck", waiting to participate in the matching process. 
When a match against a structural expectation is successful, a path of dialogue events is 
created, linking nodes for utterances to expectations. This‘ path is attached as a branch on 
the Event Tree (of this, more below). 

Figure 9.4 illustrates the notion of a structural expectation, again’ with the simple 
example of a question asked in a console session environment. This time, however, it is the 
system that asks the question. If all proceeds according to the standard path of the ask-and- 
answer method, the user will first find the answer (an assumed step from the point of view 
of the system). As the ask-and-answer method is written now, the next step is a call to state- 
and-acknowledge the answer. Thus, state-and-acknowledge is one expectation, athough the 
Interpreter must go through a fayer of calls and method searches to find the speech act 
expectation STATE. It is the STATE step that is at the same level of aggregation as the | 
utterance. 5° Since the user may decide not’ to’ answer the question, other structural 
expectations are possible, among them lead-ins to recovery paths. associated with ask-and- 
answer, lead-ins to general recovery methods, initiators, and metadiscussion. Figure 94 
shows one such possibility, a STATE that is the lead-in step to the recovery path that says 
that P2 does not know the answer. (This STATE is not represented as a substep of state- 


and-acknowledge, because the expected responses are different.) 


55. Additional mechanism, however, is needed to bridge the gap between the speech act (eg., 
STATE) and the utterance. The sections that follow ‘consider the mechanism frecessary to 
bridge this gap for ISAs. - 


— 
Ww 
QW 


EVENT TREE: STRUCTURAL EXPECTATIONS: 
participate in a console session state-and-acknowledge 
agent: PI (the computer system) the answer 


co-agent: P2 (the user) agent: P2 
co-agent: Pi 


get some information . |STATE the answer 
agent: P2 


destination: Pl 


ask-and-answer a question 


<POSSIBLE STATE that P2 doesn’t 
CONTINUATION know the answer 
POINT> agent: P2 

destination: P1! 


ASK a question 
agent: PI 
destination: P2 
j 


| 
1 
| 
t 
t 
4 
4 
' 
4 
1 
{ 
i} 


“What time is it?” 


Figure 9.4. An Event Tree configuration with a structural expectation 


134 


Note that although it was straightforward in the example to get from “state-and- 
acknowledge the answer” to the speech act expectation STATE, in general there may be a 
number of intermediate nodes between an expectation from the standard path of a method 
and the possible speech. act: step(sh : Each node on the Event Tree can be a choice point, 


where one of several. methods is chasen to match the call Since: the ae of the agent can 


og at 
agi. 


influence the choice of method, recognition of an utterance, ie. censeucig the. sapeoiate 
Event Tree path, is seheeally done in an environment of incomplete knowledge. It is not 
immediately clear, therefore, whether a path should be built top-down (from most aggregate 
expectation to speech act expectation), bottom-up, or using some hybrid apprbach: 

The answer to this problem suggested in [3] is to use different path-building 
Strategies for different basic utterance types. Recognition. procures for initiators, 
metadiscussion, and the more general variety of recovery siscnssion hich tend, particulary . 
in task-oriented. ae to be. only: weakly determined by. cones. can depend heavily on 
the utterance itself. In terms of the Event Tree, this n means that: paths are buik essentially 
bottom-up. Recognition procedures. for standard path successor steps and recovery 
discussion that is closely tied to the structure of the dialogue can be driven by. the most 
aggregate structural expectations (evaluated with respect to the current eucttcnevens) since 
in this case the expectations will tend to embody the better information. For these types of 
utterance, Event Tree path construction can proceed in a generally top-down direction. 
Using such a mixed strategy, the recognition process can take advantage of the context 
supplied by the method representation and use different, more utterance-centered stsategies 


when strong contextual information is not available. 


10. General Observations about ISA Recognition 


Given the general process model for recognition outlined in the last section, I can now 
discuss issues related to ISA recognition. In the first two subsections 1 consider the 
implications of the recognition framework for ISA processing. The final two subsections 


discuss the problem of ambiguity for ISAs. 


10.1. The Role of the Recognition Framework 

The structural sppevatione and the c basic utterance types introduced in Section 9 can 
play a role in the recognition of ISAs. Within this framework: most of the examples used s 
far can be thought of as initiators. ror BCR: Can. you pass ‘the salt?” "ean init an 
independenc task associated with cating a meal. Not all ISAs, diewever: occur as initiators. 
Consider, for ean the process of paying for merchandise with a credit card. If, some 
way along in the UP RIOCESS, the salesperson u utters , 101, this is typically a satacemie not 


merely a representative. 
10.1 I need your signature. 


} claim that a good way for a process model to determine the correct speech act is to use the 
fact that the pay-with-credit-card method will contain a step to the effect that the 
| See REQUESTs the customer to sign the credit ote mane this structural 
expectation, the iscciia cee knows that it is looking for. 2 REQUEST f force (possibly er 


other alternatives) It can then ask whether the tncoming utterance can be construed as a 


136 


REQUEST. More precisely, to recognize an utterance as a standard path or recovery path 
sticcessor step or as a lead-in toa a path, the erie can do what is necessary to 
find the eee act(s) ie by hypothesizing peri Event Tree paths top-down from 
evaluated method steps), find whe ISA forms associated with this pee act henhier stored 
explicitly or in the form of sence rules), and ask wpethies Fee utterance matches any of 
these forms.%© | a | | 

In contrast to standard and recovery pas: context is — maker for initiators, 
Bevisiat recovery discussion, and mictadiceusseant 57 In incre cases, 1 have proposed that the 
Event Tree pathis) be grown: bottom: “up. For ISAS this means I tating with an ISA. form 
San aie what an acts it could ms: | | 

In hunker about ISA representations andthe the details of recognition, then, I will be 
considering both directions: ISA form to pagel act thenceforth abbreviated as $ ISA -> 8A) . 
ana Spee: act to ISA form (abbreviated as SA. > ISA). This bi-directional approach 
follows from the choices made in the aera oganien ae Since both directions are 
covered, it also means that the discussion in the following sections should be a to 
dialogue process models that are similar to the one proposed bet which: use:a different mix 


of bottom-up and top-down strategies to relate utterances to expectations. 


56. Here and elsewhere, when J talk in terms of utterances matching ISA forms, | really 
mean a match between the surface semantic representation of the ‘utterance: and the 
transitional representation of the: ISA form vee Appendix: et = a chaser of these 
representations. 


57. It is not, however, clear to what extent metadiscussion uses ISAs, since these utterances 
tend to fall into rather fixed patterns. 


137 


10.2. ISAs and the Event Tree 

In order to represent ISAs, one Event Tree path is buitt for each speech act conveyed 
by an utterance."8: Given this approach, example 5.19 (“It’s cold in here” uttered as a 
REQUEST to close the door) would have the Event Tree informally represented in Figure 
10.1. | | 


<header of the highest level method 
for this task-oriented dialogue> 


aa. 


give information get some task done 
state-and-acknowledge request-and-respond for 

the information the task 
STATE that REQUEST that 

the room is cold PI close the door 


surface semantic representation of 
“It’s cold in here.” 


Figure 10.1. A possible Event Tree configuration for Example 5.19. 


58. An utterance may, of course, suggest more than one possible set of Event Tree paths, 
and the choice between them may not be obvious. This is a problem of ambiguity, which 
will be discussed in the next two subsections. 


138 


Recall that the surface semantic representation of the utterance is associated with speech act 
nodes but is not itself an actual node on the Event Free. 

Despite the multipte path representation given to some utterances, ISAs >a -task- 
oriented dialogue are typically motivated bya ‘single primary “hierarchy of goals‘ and 
subgoals held by PI. That is, in the usual case, one of the speech acts conveyed reflects the 
primary motivation behind the utterance. _B thorcovet: kad as act force that is the most 
important will be the fina/ one, ie, scanty for-utterances with immediate and secondary 
forces, tertiary for a three-force utterance. For the “It’s cokd in here” example the primary 
goal hierarchy would include éeting the room warmer by getting the door shut, and thus 
REQUEST can be thought of as the more een of the two sees acts conveyed. 

The observations in the last eaters have two pans for the general 
recognition mechanism that has been outlined. First, in those cases where I advocated top- 
down processing (Section 9.2), if an titterance'by PI denen muttipleiforces, only one of the 
ferces will match a structural expectation related to the current coangursies of the Event 
Tree. This force can be thought of as conveying the primary goal of Pl. Second, the force 
that matches the expectation will be in rma one. Sens n2 nee the type of Event 
Tree configuration that 1 am assuming. - gues 03 and 104 illustrate cases that I am 
assuming to be non-standard in task-oriented dialogue, because they contradict the first and 


second assumptions, respectively. 


59. I emphasize that this restriction applies to the usual case. Exceptions are, of course, 
possible, but I would expect to see additional mechanism called in to handle them. 


139 


buy merchandise with credit card 
agent: PI (the buyer) 
co-agent: P2 (the salesclerk) 


<intervening substeps> 


fill out credit slip 
agent: P2 
co-agent: PI 


request-and-respond 
for PI to sign the credit slip 


STATE that REQUEST that 
P2 needs PI to sign PI sign the credit slip 
the credit ship 


rd 
surface semantic representation of 
“I need your signature.” 


Figure 10.2. Event Tree paths for a two-force ISA. 


140 


In Figure 10.2, note that the STATE and REQUEST nodes would be marked with 
the same time and surface semantic representation, so there would be no chance of 
construing them as totally independent events. 


SPEECH ACT! ,_,,, ... SPEEGH ACT 2 
‘ ae 9 and ? 2 *y tg 
' ' 


IMMEDIATE FORCE . SECONDARY FORCE 


Figure 10.3. A questionable Event Tree configuration for an ISA 


Figure 10.3 is only skeletal, because I have not found examples of utterances that relate 
to two independent tasks (or independent parts of the same task) by a split in their speech 
act forces. 

Finally, Figure 10.4 says that one can answer a simple, question about the temperature 
of the room with an utterance that 1s, secondarily an indirect REQUEST. 1 claim that 
Figure 10.4 is an ill-formed Event Tree, if it is tikes to represent one of the ISA types 


discussed in this paper. Note that the a SUT ti D that only the final force can match a 


structural expectation related to the Event Tree implies that, once such a match is found, 
there is no need to derive further indirect forces. 
In summary, Event Tree ISA representations involve one path per speech act force. 


For top-down recognition processing, however, only one path matches a structural 


141 


ask-and-answer 


ASK 
what the temperature of the room 1s 
ona scale of cold, medium, and hot. 
agent: P2 
destination: PI 


find the answer 
agent: PI 


state-and-acknowledge the answer 
agent: PI 
co-agent: P2 


STATE that REQUEST that 
the room is cold PI close the door 


surface semantic representation of 
"It’s cold in here.” 


Figure 10.4. Another questionable Event Tree configuration for an ISA. 


42 


expectation related to the Event Tree -- the path generated for the final speech act force. 


10.3. Some Remarks about ISA Ambiguity 

The last two subsections glossed over the abchotias of ambiguity, and I would like to 
remedy that situation now. In the recognitigfi framework . described in Section 9, 
ambiguities with respect to speech act force occur when an utterance matches more than one 
structural expectation. A choice must usually de-made'.as. te which iciebation was 
intended.. In this subsection, | will make. some geieeeal cbiervanens ont the oe of 
making this choice. Three points will be discussed: . 

1. the relationship. of ‘cibieustin reasoning and reasoning done for other 

purposes 

2. the effect that a re ee on the disambiguation process 


3. systematic versus ventysiiea tie ambiguity between the ISAs conveyed©0 


Starting with the relationship of disambiguation reasoning to other sorts of reasoning, 
the most iegortant point pee disambiguation reasoning is reasoning about the world 
model of the conversational partner. For example, if Pi makes an utterance to P2, any 
reasoning that P2 does about the need for an action to be done, etc. is done with respect to 


what P2 believes about PI's world: model. This is not necessarily the same as P2’s model of 


60. Searle in [30] has questioned the use of the term ambiguity to refer to differences in the 
forces conveyed by indirect. forms, Since the. notion of meaning. that. J.am using (see Section 
7.3) is broader than Searle's, and, since differences in conveyed forces amount to differences 
in meaning as I construe it, J will continue to use disambiguation to mean choices among the 
possible ISA forces of an utterance. 


143 


the world (or for that matter as PI's; this is a common source of error). This said, 1 wilt not 
say much more about the divergence of world models. Recent work in this area is that of 
Cohen [6]. To justify limiting the scope of this paper, 1 appeal to what appears to be the 
default assumption: | 

If there is no information to the contrary, assume that the dialogue partner's world 

model is the same as your own. ae ee 
So, although | will note places where dual world models become a factor, I will assume for 
the purposes of exposition that there is "no information to the contrary.” 

Reasoning for disambiguation, then, does not always use the same knowledge sources 
as general eioaing A further point about disambiguation reasoning is that it can be 
quite complex. Where REQUEST is a possibility.as a conveyed: speech act, it may be 
necessary for P2 to evatuate capabilities, willingness; authority: relationships, schedules of 
activities, etc. This sort of evaluation can be an expensive process.. Note, however, that the 
same information needed to disambiguate is often also needed to decide whether to comply 
with a directive or to determine whether information conveyed in a representative was 
previously known. Thus, a cnnian that can-save its reasoning processes -- or at least 
save those steps and results known to be of ‘potential interest later on-- can avoid 
duplicating what appear to be inherently expensive processes. 

This brings me to my second point, which relates to the effect of the dialogue 
environment on the disambiguation process: One important-aspect of dialogue is the fact 


that Pi is available to aid P2 in disambigating PI's utterances. Dialogue participants quite 


44 


commonly ask for clarification of intent, eg.,.10.2 to a “Do: you know..?” question or 10.3 to a 
“Can you <action>?" form: 

10.2 Are you asking me or telling me? 

10.3 Yes. Do you want me to? 
Another frequently used strategy is for P2 to make a disambiguating choice clear in a 
jess An example here is 10.4 in response to a "Can you <action>?” form interpreted as 
a REQUEST: 


10.4 Sure. I'll get started right away. 


One result of this sort of response is that PI can go on to alert P2 if P2's interpretation was 
not the one that Pt intended. 

The dialogue environment, then, provides additional opportunities for aquiring 
disambiguation information and, if responses are constructed carefully, it provides checks to” 
prevent misunderstandings. 6I Disambiguation: in task-oriented dialogue may be difficult, 
but the options available effectively limit how hard a dialogue participant must work. 

We turn now to the third topic of this subsection, the distinction between systematic 
and nonsystematic ambiguity in the speech acts conveyed by an utterance. A systematic 
ambiguity for ISAs occurs when all forms within an ISA subset display a particular 
ambiguity with respect to the speech act conveyed. Systematic ambiguities for ISAs include 
ambiguities that occur between direct and indirect innnediaee forces of an utterance. This 


type of ambiguity is common to all ISAs that do not contain special identifying 


61. For a systematic approach to differences among different linguistic modalities, see Rubin 
{23}. 


145 


characteristics (e.g.. some frozen ISA forms as described in Section 7.4). A familiar example 
is whether 5.3 has ASK or REQUEST as its immediate force. 


5.3 Can you Close the door? 


Another type of systematic ISA ambiguity is whether or not a given secondary force applies. 
This type of ambiguity is common to all ISA forms which may convey a secondary force, 
and the familiar example is whether 5.4 conveys a REQUEST in addition to the immediate 
ASK force. 


5.4 Are you able to close the door? 


Given what has been said, nonsystematic ambiguities are ambiguities that are not 
shared by other similar forms. The occurrence of nonsystematic ambiguity is more 
fortuitous, Such an ambiguity typically. arises from a special configuration of structural 
expectations, e.g., for standard path successor steps. Either because of lexical ambiguity 
within the utterance or because the expectations are similar to each other, an utterance 
matches more than one expectation. 

In discussing aspects of the process model, I will be concerned only with systematic 
ISA ambiguity. For this type of ambiguity, the structural expectations that match the 
utterance are all of the same basic. type, that. is, the utterance has several possible 
interpretations as an initiator, or several as a standard path. successor step, etc. Ambiguities 
in which possible structural expectations are drawn from different basic types are 
nonsystematic. 


Because systematic ISA ambiguities are predictable, it is possible to exploit special 


146 


knowledge about them and tailor special disambiguation mechanisms to them. For 
nonsystematic ambiguity, it appears that the best tools we have available are general 
heuristics and a general disambiguation mechanism. 62 The mechanism is “general” in the 
sense that it is not keyed to particular ISA ambiguities. In this paper, I restrict myself to 
systematic ISA ambiguity; the next section discusses the application of special knowledge 
associated with ISA types. Before that, however, it is useful to take a general look at the 


dependencies that must be taken into account in the disambiguation process. 


10.4. Dependencies in ISA Disambiguation 

A computational approach to ISA disambiguation has several inherent problems. For 
those utterances with only an immediate speech act force, the recognition process will not 
necessarily be simpler than for utterances with more than one speech act force. For example 
“Are you able to open the door?” could have the ‘immiédiate force ASK only, with no 
secondary force. For this type of utterance, it will often be necessary to consider the 
possibility of a secondary force in order to rule it out. 

Also, for utterances where an action could’ be implicit, it is often mecessary to 
determine whether Pt intends an action (and what that action is)'as part of the process of 
determining the immediate or secondary force. For example, 10.5 could have the immediate 
force SFATE, or, say if P1 were biackniailing P2, it' could have an immediate REQUEST 
force. 

10.5 T want a million dollars. 


62. In [3] I discuss the use of ordering of match attempts as one part of a mechanism for 
handling nonsystematic ambiguity in task-oriented dialogue. 


147 


It is necessary to determine the likely implicit action (in this example, that P2 give PI the 
money) in order to decide whether this particular action can be REQUESTed in this 
environment. 

On the other hand, for most (and possibly all) ISAs that do have a secondary force, 
determining the immediate force can be dois independently of the secondary force. This is 
because, as noted in Section 7.3, most ISA classes considered hee are derived by only one 
level of indirection. © We thus do sk - ee itustrated s Figure 10.5 where a 
surface form leads to two different immediate forces with two different associated secondary 
forces, the latter of which must be considered in determining which of the former apply. 


Instead the case that obtains is Figure 10.6. The computational implication of 10.5 and 10.6 


is that immediate forces” can be determined per tne interance and from context, 


‘independent of any connie ation. of the nature of the secondary force that met apply. In 


the Process ‘model, we can ae this independence of hole. ¢ of immediate force where it 


exists, with the normal gains in simplicity and shoul that come from decoupling 


_ processes. 


63. Among the ISA forms that I ant:considering:' the: ofily: possibleexception to this ‘is the 
type 3 implicit-action form. Even in this case, however, we would have to have a very 
Specialized type of form in order for determination of immediate force to be dependent on 
secondary force. We would have to be dealing with what I am calling a frozen ISA form 
(see Section 7.4). The immediate force would not correspond directly to surface form, and 
there would only be a secondary, not a tertiary, speech act force. Although all of this is 
theoretically possible, | have not found any examples in the transcripts that I have 
examined. 


148 


Immediate Force-} + Secondary Force-l 
Utterance < 


Immediate Force-2 + ‘Secondary Force-2 


Figure 10. 5. A type of ambiguny in n speech act force that 5 probably 
does not occur. , 


Immediate Force-! + Secondary Force-1 
Utterance Immediate Force-l + - Secondary Force-2 
Immediate Force-! 


eg, 


Utterance = “The garden is full of weeds.” 
Immediate Force-1 = a STATE that the garden is full of weeds 
Secondary Force-1 = a REQUEST that P2 weed the garden 

(eg., in a context where this is P2's job) 
Secondary Force-2 = a SUGGEST that P2 weed the garden 

(eg., in a context where P2 is deciding what to do) 


_ Figure 10.6. The common type of ambiguity in speech act force. 


149 


H. Outline of a Mechanism for ISA Recognition 


Given the framework for ISA recognition discussed in Sections 10.1 and 10.2, along 
with the observations about ISA disambiguation from 10.3 and 10.4, we can now examine 
the mechanism for ISA recognition ins more detail. This section stat with a discussion of 
the eee of the rules from Section 6. 6. Secon ae considers a mechanism for 
recognizing frozen ISA forms, and Section 113 eke at the : fecoeakivs of iieplich sttion 


ISAs. 


Ill. Applying the Rules for Explicit-Action Forms 
| Recalt that the ISA rules in Section 6 ie based on properties of speech acts as a 
class, preconditions of individual speech acts, and (for some action-centered speech acts) the 
semantic input cases of the actions. in this fubeetsion: I outline a mechanism for 
recognizing rule-based ISAs, splitting the discussion into the ve ditecuons identified in 
Section 10, SA -> ISA and ISA -> SA. | | | 
SA -> ISA is the easier direction | to describe for these ISA rules. We start with the set 
of structural a cel that are to ine matched eo (Section 9.2). (Whether the 
mnaitch process should ee sequently or in n parallel is an oper hans that will not be 
considered here.) ECSU on one speech act from this set of expectation, we can obtain 
the preconditions of the individual speech act e following. the links in the inened structure 
described in Section 4. It is also meoreate to get from an action ) named in an 


utterance to an athe Maes set of semantic wt case speciation ‘Ge: oe ambiguity, a 


150 


group of such sets). Given these semantic structures, we can apply ISA rules 6.3 to 6.10 to. 
yield EN particular to the speech act. (Alternatively, these ISA geplie can be pre- 
Seneraier and associated with the speech act.) The ISA sails associated with the speech 
act, song with patterns from Rules 6.1 and 6.2 which are applicable to ) speech acts in 
general, can then be evaluated with respect to the « current akg envionment. One. of 
the major effects of “ie evaluation proces | is $ to bind represeations fort the appropriate 
discus participants to Meanie case vitals in the pattern. The surface semantic 
representation of the utterance can then be matched against each of these Sahin 
patterns. 

In the SA - -> ISA direction we encounter predominantly nowsystematic ambiguities 
among speech act forces Therefore, the type of general cavers mechanism chosen 
dviecwaies how the successful watcher are treated aor whether siaich attempt continue at . 
all peyond an initial success). | 

The ISA -> SA dineciicn for the general rules is more sienptated When the 
necog tion mechanism is morning as Seaslil ~ method structure and associated 
bers tonsiips conveniently group the representation sn which the ISA patterns are Palen 
Working bottom-up, we must contend with the systematic siete uid discussed in the last 
section. This means that for almost every utterance we will have choices between gecth 
ies Siivenedl choices whether a given speech act force spplek both. This introduces 
several problems. a —~ 


First, at least for a serial matching mechanism, in the worst case every different 


speech act might have to be checked to see if the utterance in hand matches any of the 


151 


possible indirect forms for that speech act. Also, independent of the matching 
implementation chosen. deciding which speech act forces: were intended by P! can involve 
very complex analysis, particularly if action-centered speech acts are among the possible 
forces. 

To cope with these difficulties, we can exploit two sources of information: knowledge 
about which speech act forms can be used to stand for others as ISAs and predictions of the 
type of knowledge that will be necessary to distinguish ae competing speech act forces. 

To use this knowledge, I propose a two-pass mechanism consisting of initial screening 
followed by analysis. Initial screening involves a match of an utterance against special entry 
patterns. These patterns are abstracted, if necessary, from patterns ‘produced by the 
. application of ISA rules such as those in Section 6 to appropriate: shaceie structures (eg., 
preconditions). The point of the abstracting process is to produce entry patterns for which 
the match will involve only context. independent properties. This means that a match of an 
entry pattern involves only information explicit in the utterance or implicit information that 
can be derived independent of context. The authority relationship between Pl and P2, for 
example, is a property that would not be appeated to as part of the initial match 64 

The abstracted entry patterns bie associated with them analysis procedures to 
discriminate between alternative interpretations. Once an entry pattern is matched (due to 
the abstraction process there will be only one success), the associated discrimination 


procedure is executed. Discrimination procedures. examine’ properties of the dialogue 


64. The abstraction process: produces patterns that are as specific as possible while still 
permitting a context-free match. Abstraction is done ‘on’ ‘the vemeen of ne ISA patterns 
only (see Appendix B.) 


context to determine which speech act forces were intended by Pi. These procedures are 


written to incorporate the specialized knowledge that we have about individual systematic 


- 


ambiguities. - 
For an example of the type of processing I have in mind, consider the following: uses 
of a-“Do you want...” form. (Other uses occur, but these four will.do for an illustration.) 


ASK+OFFER 
one friend to another: 
Do you want to take my car? 


ASK +REQUEST 
one roommate to another: 
Do you want to answer the phone? 


ASK »SUGGEST 
parent to child: 
Do you want to try to build an arch? 


ASK 
one student to another: 
Do you want to. go back to school?. | 


Any of these examples pass an initial match of a representation informally conveyed by: 65 


interrogative: whether P2 wants nonverbal action 
or verbal action whose destination is not. Pl 
except for: |. actions involving joint agency of Pl and P2 
2. the action of remembering a previous REQUEST 


65. This pattern illustrates the abstraction process. Since abstraction only happens for 
variables, only the representation for the action desired is affected. For OFFER, this action 
is “some action that: complements-PI's part of the action {OFFER IV:).. For REQUEST, the 
action is “a non-verbal action or a verbal action whose destination is not Pl, except for the 
action of remembering a previous REQUEST.” For SUGGEST, actions with joint agency 
of Pl and P2 are excluded, while for ASK there is no restriction. To produce the entry 
‘pattern, the restriction. on. the..action.in. OFFER. is. ignored, because context dependent 
information might. be: required to. verify -it. -This leaves SUGGEST and. shiodigias with 
restricted actions. An entry pattern is then formed using both of these restrictions. - 


153 


Once any of the four types of utterance above has passed the initial screening, we need a 
procedure to determine the correct speech act forces. To discriminate between the various 
speech act possibilities, I suggest the tests listed below applied as shown in Figure t1. 6 
Tests for REQUEST: 
1. Does the action named fall within some authority, complementary role, or “be helpful” 
obligation of P2 to Pi? (Recall the discussion of these obligations in. Section 3.) 
2. Would one generally expect that P2 could successfully take responsibility for the action 
named? 
The ASK Block: 
I. Is there a reason why PI would want to on the literal interpretation of the utterance 
without necessarily wanting to set in motion the action named? 

2. Does the context preclude setting the action named in motion? This would be the case if 


the context 1s not a normal one for the action and there is no reason to assyme that the 
standard conditions have been overridden. 


Tests for OFFER: 


1. Would one generally expect the primary benefit of the action named to be to P2 in this 
context? 


2. Is Pl to be an active participant (ie. AGENT or CO-AGENT) in the action? 
3. Does it look like Pl can successtuMy take responsibility for an action that complements the 


action named (e.g., “lend a car” for “take a car")? One issue involved here is, whether any 
objects of a transfer fall under PI's ownership, control, etc. 


66. Recall from Section 103 that these tests are to be done with respect to the world model 
that P2 believes that Pl is using. 


Is Pi the AGENT of the action named? 


NO . |v 


Does the utterance past the tests for OFFER? 


YES NO 


Does the utterance pass the ASK block? 


YES NO 


ASK-OFFER 


Does the utterance pass the tests for REQUEST? 


NO + YES 
Does the utterance pass the ASK block? 


YES |NO 
ASK +REQUEST 


Does the utterance pass the tests for SUGGEST? 


YES NO 


Does the utterance pass the ASK block? 


YES NO 
ASK +SUGGEST ASK 


Figure 1.1. General organization for discriminating between four 
interpretations of the form “Do you want <action>?” 


Tests for SUGGEST: 


1. For each equal authority: relationship. between: PI and | ie can the action named be 
construed as one within the se BManship? 


2. Is Pl not to be an active participant in the action named? 


3. Would one generally expect the primary benefit of the action named to be to P2 in this 
; context? : 


4. Would one generally expect that P2 could scent take responsibility for the action 
named? : 


The set of ee and ie Uva are inietere for a limited | set of el of "Do 


hue ao 


you want.?" forms, but they are representative of salle discrimination baat The: tests 


given include checks of the speech act ¢ preconditions (oe ‘Setiee a3) as well as hicks of 


pee 


semantic case wc for the ore act or action warned Some of these eg. the third test for 


SYCREST. a be more restricted than the comparab precondition or semantic case. 


specific ication. The following c criteria were ae to decide which preconditions to veuebiide rom 
the tests: 


I. Is this precondition shared by all of the speech act possibilities? 
Preconditions shared ‘by all alternatives may be worth: verifying, but not as part-of.a 
discrimination procedure. 


2. Is this precondition being questioned or stated by the ISA form? 
This criterion rules out REQUEST (Hl), SUGGEST: (VJ), and OFFER (VI) for the 
ane ‘you want. ~ example. 


Pi a eee 


3. Is the sec act usually the only evidence for this precondition 


67. This condition seems to be limited to the case in 1 which the utterance is acting as an 
initiator (see Section 9).. Utterances that convey a SUGGEST act but: which occur. in a 
context of stronger structural expectations seem to dispense with this condition and use the 
more general SUGGEST ‘precondition (VH). instead. (F have suggested:.a: basically.tap-down 
(SA --> ISA) treatment of this latter type of utterance.) It is not surprising to see stronger 
restrictions on the range of initiators, since the context needed for disambiguation is weaker. 


156 


An example is the obligation in SUGGEST (IV) that the DESTINATION of the 
SUGGEST consider the action suggested. Such obligations rest on very general 
properties of cooperative activity, for which there will not usually be good particular 
“evidence. Similarly, there is’ frequently no ‘eviderice’ uthet ‘than the speech ‘act: for 
intentional states of another individual. “Préconditions ofthis type! may be worth 
checking as a tie-breaker, but the chances of getting useful information are small 
enough that these preconditions should: probably be excluded from the routine 
ehecbing ¢ of a Aiserminalion prosedie: 


oy 


One Renerat PAN about ed Eeeaner prececure relates to the Independence 
qr 

issues discussed in Section 10.4. Recall the point made there that immediate forces: can 
resend be deter mined independent of soa ones. This SMIepe Neen is reflected in 
Figure tA, where the iis ho cae is on determining what, if any, secondary force. applies. In 
that case ENakhosigh not in wall) edie: fact that the Immediate foree-* was ASK fell out from the 
first-pass match of the emt pattern for "Do ees want...” forms. | 

A discrimination ‘procedure such as the one described will not oy produce ; a single 
answer. Lexical | ambiguities within the utterance might lead to several different 
alternatives. In addition, particular tests can lead to ni maatiple akernatives, especialy those 
that involve rich contextual structures such as role ieian oir cat nee cases such as thes 
where a discrimination peaks fails to sufficiently narrew the possible iterpretatons the 
recognition mechanism can appeal to other sources of information. One useful source is the 
implications of particular ISA choices (eg., with respect to podtenes, contadesics ‘of speaker, 
formatity of the environment, ager! of ane ape act, aks Another is the raw. likelihood 
that a given form is used to convey an individual spech act or set of apes acts. The 


piney value of these sources would rapeet® to-be in ruling out interpretations 


The: next’ issue that must be addressed is the generality of the discrimination 


157 


procedure associated’ with the four types of “Do you want.” utterances. Can the | 
discrimination procedure be used for other types of ISA forms? In general, I think that the 
answer is no. Although discrimination procedures for ISA forms will often appeal to similar 
sorts of information and use similar tests, they differ in.the following ways: 
I. Deletions i and additions to the set of ao act possibilities can alter 
dependence relationships, which a turn affect the order in which tests are done. 

2. Some forms are tore commonty used for a speecty act than others. Even. where the 
same speech act set is possible for two patterns, differences in raw likelihood might 
affect the order in which tests aré performed. 

At this point it is not clear exactly how many discrimination procedures will have to be 
written. The number of procedures is related to the number of phase ee and 
this number depends on the number of speech acts and the number of ISA forms known to 
a system. We are saved from the spector of infinitely proliferating discrimination 
procedures by the original intent of these procedures. Discrimination procedures are 
specially tailored to a specific systematic ambiguity. We need only write as many different 
discrimination procedures as there is specialized | information. Where specialized 
information is lacking for an ambiguity, the entry pattern can have an associated list of 
possible speech act forces and a call to a general discrimination procedure. The core of this 


| general procedure would: be a check ofall the preconditions of the possible speech act forces. 


rk 


This has been a good deal of detail, and a summary may be helpful before going on. 


-First, in the SA -> ISA direction the recognition mechanism starts with one or more . speech 


158; 
a 


act expectations. The task is to fend or derive the. appropriate ISA.patterns, evalate these . 
With ‘respect to the current dialogue context,- and. match-the:utterance against them. The. 
method. and: other semantic hinkages discussed in Patt I-provide a. way to oeganize the ISA 
patterns and thus make the SA.-> ISA:operation relatively straightforward... 

In the. ISA -> SA direction. the i ea mechanism starts with the ISA pallet 
and its task is to shale ae. or ‘inte Speiigassiasdedpiag gpeech acts, ‘For this, I have 
eget a seoinenton, peers nananaeuicis spavterns fon. abstractions of 


Cah ha es 


them). These discrimination iqusedures oeimlics on the syematic. armbiguisies disptayed 


by ISAs. 


1.2. Recognizing Frozen ISA Forms 
Recall that frozen forms were defined in Section 14: as ; follows: 
A frozen ISA:ferm is an ISA form with an immediate force thet. does not correspond to 
the surface form. For such an utterance, one or both of the mies obtains: 
1. It can or does display eat by behavior atypical of the rest of the form. 


2. It can or does display cooccurrence ¢ behavior r atypia of the rest of the form. 


To record this special behavior, frozen forms require separate. patterns, distinct from 
patterns derived from generat properties of speech acts and: actions. ..Also, since we want to 
use different disambiguation schemes according to the direction of the match {a general 
cisarbiguation scheme for SA -> ISA ane co gesanied for ISA -> SA). we 


| need one set of frozen form cai for each direction. “White this may sound he sige I. 


159 


have found it to be useful, because other information retated to the path-building process 
can be associated with the different sets of patterns. (The “key” representations and "way" 
methods in [2] constitute such a split in patterns for frozen forms.) 

Besides having patterns distinct from the general ISA patterns, frozen forms will often 
have specially designed discrimination procedures, One reason for this is that the special 
surface behavior of some frozen forms will elimitiate one of more possible interpretations 
altogether. Consider, for example, the familiar pair 5.3 and 5.4: 

5.3 Can you please close the door? 

5.4 «Are you able to please close the door? 

Without please, 5.4 can be either an ASK only or secondarily a REQUEST; the please in 53 
eliminates the possibility that it was intended ‘is an ASK. | 

Even forms that do not show special behavior themselves may be more prone to 
certain interpretations, justifying the use of separate discrimination ‘procedures. For 
example, IL1 is more idiomatic as a REQUEST than as an ASK: To convey an immediate 
ASK force, a form such as I1.2 is generally preferable. 

1.1 Wilt you shut the door?” 

1.2 Are you going to shut the door? 

(Example 11.2 could also convey a secondary REQUEST force, but ‘that is not my point 
here.) 

Frozen ISA forms, then, require separate patterns and, in at least sone cases, separate 
discrimination procedures. They also require special treatment im the recognition matching 


algorithm. Because frozén forms are special caseé, it is reasonable to’try to match them first 


160, 


(in a serial algorithm) or to assign a higher priority to a successful frozen form match (in a 
parallel algorithm). This. policy could result. in some non-frozen utterances being treated as 
frozen ones. eg., a "Can. you. <actiqn>?” form. intended as immediate ASK, secondary. 
REQUEST. This simplification does not, seem to.me to, be serious. Often, information will 
be too sparse for P2 to determine how seriously Pl intepds.a question such as the “Can you 
<agtion>?” form. Since such a wide range.of responses to. ISAs. is. possible (as noted in 
Section 5.2) we are not restricting the. flow of the. dialogue. by treating. all such forms as 


frozen. 


H.3. Recognizing Implicit-Action ISAs Gave Biel 
In this subsection, we consider the problem.of, integrating the recognition of implicit 
actions in with ather aspects of the recognition: of ISA forms. Discussion will concentrate on — 
the "when" of implicit-action processing. Section 113.1 considers the treatment of ISAs 
without. tertiary. forces, the special problems associated. with tertiary forces are discussed in 
Section 11.3.2. 
in identifying a particular implicit action. Toa 1 large ‘extent, this process ‘is dependent on 
general solutions to knowledge. representation. problems. it.is, nota process unique: to ISAs. 
Nevertheless, issues of search strongly motivated the distinction among ISA types in Section. 


75,  Simuarly, the. tie-in between the rules in Sections 6 and 7 and the action representation 


in Section. 4.was.partly ragtivated by the search. problem. Also relevant 


ta this problem is 


the discussion of knowledge. sources. in. Section 12; these squreesare expected to guide and 


constrain the search. 


161 


H.3.1. Integrating Implicit Actions into the ISA Mechanism 

As usual, the ISA recognition problem will be broken down according to the direction 
of match, and, as usual, the SA -> ISA direction is the easier one to describe. Recall that 
the implicit-action ie given in Section 75 combined a pattern intended to natch the 
surface semantic i al of the utterance with a pattern built from a general ISA rule 
(Section 6) That 1s, implicit-action rules: can be though of as beginning where explicit- 
action rules leave off. | | 

The straightforward approach to the SA --> ISA match for implicit- actions starts with 
the explicit-action patterns evaluated to > the current dialogue environment. ‘Implicit-action 
rules are applied to these evaluated patterns. In the course of this process, some implicit- 
adiiva rules will be found to be napehobe. ond ‘the others will be evaluated. The — 
utterance can then be matched against these evaluated implicit-action patterns. ‘Both 
aa and implica action ones evaluated with respect to the current dialogue context, 
- could be matched in whatever order | or combination desired, allowing, of course, for the 
dependence of implicit-action forms on explication ones. 

There are several possible variants to this approach to Lacie implicit- -action 
patterns, the order of revaluation and ru rule application depends toa high degree on the 
generat ees scheme chosen. Whatever the final chotce, however. the important 
point about the SA --> ISA direction is that patterns € evaluated annie — to the current 
dialogue environment “ah be quite pestle | In many ¢ cases, a a particular implic action will 
already be ‘represented in the evaluated pattern, For exampk, utterance 10.1 in the context 

yea dua. 
discussed in Section 10 would match a SA > ISA pattern that explicitly ‘contained both a 


representation for “signature” and a representation for a signing action. 


162 © 
10.1 I need your signature. 


Thus. for such examples there is no need to search for an an n implicit action. - For other SA --> 
ISA examples, the eee st about werner a particular action is intended and t ine search for 
the action can be done. as is part of the pattern match. | 7 

| Turning. now to the ISA -> SA pirecton we nave the problem of imegratng imp | 
actions into the discrimination procedure mechanism used for both genera ISAs ind frozen 
forms. The principal question is at what point in the abies the choice of -- or search for 
~- the implicit action should take place. This ban requires careful anatys mecaute the 
process of identifying an amplicie action appraise! to be an inherently expensive one. For 
most of the implicit-action types, tne process is heavily cepenes on a wide rare of 
knowledge sources, where the exact source is not predictable beforehand 

Identification of implicit actions can either be Mone @ at a tixed plc intl the botomap 

recognition process or at different places in me Process depending on the speech act 
possibilities involved. The obvious candidate for a ince place is the first matching pas. 
That is, the implict 3 action wourd be determined « on nee? to the discrimination procedure as 
part of matching the semity pattern, Th The evidence however, points seine oem 
implicit actions during the initial match pan First, the implict-acton rules as written 
would not be easily applicable to anny patterns. This is because eee rules are 
specific to individual speech acts, and entry paterns 2 are not. Even if some Impl action 
rules were generalized, the link between + emp actions sand individual at speech acts goes 


deep. For example, If3 can be either an OFFER by Pi to > give P2apenora REQUEST 


163 
that P2 give Pla pen 68 
1.2 Do you have a pen? 


The choice between these two alternatives. can only be resolved within the discrimination 
procedure. There appears to be no advantage to generating both representations early and 
carrying them forward. 

A similar problem arises for forms where implicit action ambiguities occur for a single 
speech act force For example, consider ihe following form: 


i.4 Do you have <INSTRUMENT> for <action>? 


If the AGENT of the action is not specified, then there are two different REQUESTS 
among the possible interpretations. The first possibility is a REQUEST that P2 give Pl 
the INSTRUMENT, and the second possibility is that P2 take responsibility for carrying 
out the action using the INSTRUMENT. 

Given the cose relationship between implicit action choice and speech act force, then, 
it appears that the choice of implicit action should occur within the appropriate 


discrimination procedures, under explicit controf of the system builder. 


W322. Recognizing Utterances with Tertiary Forces 


As was the case for other ISA classes, the fact that utterances can convey a tertiary 


68. For an OFFER, example 11.3 is derived by an application: of an implicit-action rute 
analogous to Rule 7.4 to the result of the general ISA rule 6.1 Clause iv. For REQUEST, 
example 11.3 comes from the application of the implicit-action rule 7.2 to the result of the 
application of the general ISA rule 6.4 to Precondition (I!). 


164 


speech act force raises not only the problem of recognizing when such a force is intended, 
but also the problem of recognizing when such a force is not intended. Recall that among 
the ISA types considered here, type 3 implicit-action forms are the only ones that may 
systematically convey tertiary forces. 

| 5.18 Close the door. 

519 It's cold in here. 

5.20 Must I tell you that it’s cold in here? 

Using our standard examples, recall that in an appropriate context 5.20 has three forces: an 

“immediate force of ASK that corresponds to the surface form, a secondary . force of a 
representative that conveys information like that in 519, and a tertiary force of REQUEST 
"that conveys information like that in 5.18... 

In the SA -> ISA siceeion: tertiary speech act forces present no special problems. In 
addition to the normal match of ISA patterns outlined in the first. subsection, the Interpreter 
can apply the appropriate rules a second time to type 3 forms te find the possible double 
indirection patterns. A match of one of the. patterns so produced -- with accompanying 
identification of an implicit action :- indicates that: a tertiary force is a -possibility. After. a 
successful match of a triple-force. utterance,.the Interpreter builds,three Event Tree paths, 
one for each force. 

Once again, the ISA -> SA direction is more difficut. The main problem presented 
by a double indirection in bottom-up pcan is that a decision about whether the first 
possible indirection has been taken (eg. ASK to the representative in 5.20 above) can be 


contingent. on processing related to a second indirection (eg. a representative to 


165 


REQUEST). More specificatly, the existence of an action appropriate to be an implicit 
action for a type 3 REQUEST form could be evidence not only for the second indirection, 
but also for the first. This means that. we definitely cannot think of the bottom-up | 
recognition process for double indirections as two subprocesses acting independently of each 
other. The decision about whether an utterance such as 5.20 is intended as a direct ASK, 
an ASK plus an indirect representative, or an ASK plus a representative eis a REQUEST 
may be computationally messy because processing cannot be subdivided in afl cases. 

Although double indirection clearly poses spetial' problems, I'do‘not. want to give the 
impression that the situation is hopeless. Note first that the analysis phase described in the 
last subsection uses general procedures. The problem, then, is not finding a way to handle 
double indirection within this framework but, rather, finding a good way. It appears that 
the answer to most of our problems can be found in the data itself. In the transcripts that I 
have examined, the proportion of type 3 ISAs to other forms is small; moreover, of the type 
3 forms, only a subset are based on a double indirection. More important for bottom-up 
processing, only a handful of different forms need to be considered as possible double 
indirections (whether or not the utterances actually are). 

Beyond simple numbers, note that the literal interpretation of double indirections 
appears to require strong contextual nappun Consider, for example, 11.5 and 11.6: 

11.5 Do you know that it’s cold in here? 

11.6 I want you to know that it’s cold in here. 
Without a specialized context (e.g., a psychological experiment) it is difficult to imagine 15 


or 11.6 uttered as an initiator with only the literal interpretation intended. 


166 


The need for special contextual support.is the most.impertant property. of thre data for 
bottom-up processing, because it means that outside of such contexts the first. indirection can 
be assumed with relative safety. The. task then, becomes.to determine whether or not a 
tertiary force applies. eg, whether 15 isan ASK and a STATE ofan ASK, a STATE, and 
a REQUEST. This task. is similar to the processing. of the. possible single indirection, “It’s 
cold: in here.” So, while.in theory double dndirections are more, difficuk computationally, it 
appears that in practice only rare.-- and. recognizable -- cases require more processing 


power than a comparable single indirection. . ieee 


167 


12. Knowledge Sources for ISA Recognition 


a ih paper I have emphasized the importance of context in the 
interpretation of ISAs. “While a complete account of the role of context. in ISA recognition is 
not possible at this point, we can ee to identi the cnowieise sources’ involved. This 


section examines the following six types of knowledge: 


|. domains of discourse 


i) 


complementary role relationships 
3. special properties of PI 

4. special properties of P2 

5. exceptional circumstances 


6. the course of the interaction to date 


These six knowledge sources play an important role-in the interpretation of implicit-action 
ISA forms, e.g., in identifying the intent of “It’s cold in here” uttered as a REQUEST for P2 
to shut the door. Although implicit-action forms will be used a3 cearipkes i chit section, 
this is not the only place that the six knowledge sources can be used. The sources also play 
a role | in supplying semantic case information that is left implicit in explication forms. 
For example the INSTRUMENT is left unspecified in example 121, bet if PI uttered 121 


iO 8 4 


while handing P2 a bread kite, en the case asignment would be clear. 


BEiat, HARD iba ed 


12.1 will you cut the bread? 


a the six: cee ‘sOUrees. CaN. supply. IN: — i” dispmbiguate forces of 


an ISA, for eiainok the possible readings ofa: "Do. you want my * forma discussed in Section’ LL 


Two additional observations must be made about the list abave. First, the six types. 
o knowledge a are not iat seen as enero ent we can ae interaction between 
sources, » especially between ven of (2) through (6) and any of the other sources on the net 
Second, recall that | am -ASSUMINE for the ore of Say that Pi and P2 sare: an 
ate world model with respect to this knowledge This assumption will not be correct in 


all cases. Sometimes ISAs uttered by PI will be isipeas by P2 in terms of 2's view of 


PI's view of the six knowledge sources. 


1Zb Domains of Discourse 


- In this section, domain is meant to include: 


1. the spatial environment: locations = as specs attics: 
2. the social environment: eg., public saiigportsiiee, parties school lectures 


3. the general task environment: eg., eating a meal, running a business 69 


In talking about the relation of domains to the Imerpreation 0 of gees 1 wish to.make a case 
for the existence of routine actions associated with particular domains = routine } wish t to 
emphasize the concepts that areca in dictionary cemeans, such as prescrtiet regia 
customary, habitual. aa is, routine actions associated _ particular states in particular 
domains are. not actions that are arrived : at by 4 ame creative: ¢ or "planning aires 


They are rather sairiaa actions whose indications for use are. common knowledge. 


69. 1 specify general in the task environment, ‘because specifictatks will be discussed 
_ Separately in Section 12.6. General task environments are activities tat shares a conecice 
Of independetit tasks, ie. tasks that‘are not subtasks of each other: } 


169 


To iMlustrate, I return to “It's cold in here” used as an indirect REQUEST for a 
particular action. In an average home, the routine cold-weather responses to this utterance 
might inchide: 

I. check if windows or doors are open or are admitting drafts 

2. check to see if more sun can be as in 

3. turn ap heat | 

4. - add clothes, Blankets, etc. to anyone who is cold 
The strict ordering of the alternative actions is not obligatory in all cases, but it is 
appropriate in some. Note that such routine actions can be associated with very specific 
domains. Another household might use the same responses but order them differently. A 
third househotd might include throwing a log onthe fire as one of the actions. For this 
example, the domain may be not only a particutar home but even a ‘particular room. To 
represent the routine actions of a demain; we can group them by the goaKs) that they 
achieve and write choice methods for each cluster. These choice methods will contain the 
known tradeoffs between the actions, as well as any ordering that 4s appropriate. 

‘Returning now to speech acts, one way to commenicate an implicit action (or other 
implicit information) in an ISA is to draw on the routine actions associated with the 
particular domain. If a particular action is intended by Pl, it mast be distinguishable from 
among this set of possibilities. Otherwise, the ISA is net successful. Where the atternative 
actions have a strong a priori ordering, we can fiker the alternatives according to what is 

‘possible in the domain and add:criteria provided ‘by theother knowledge sources discussed 
in this section. Where there is no particular order of choice, either other knowledge sources 


should be applicable or other routine actions should be inapplicable in the environment. 


170 


Assuming that an action was intended, if the implicit action is.stil ambiguous, then either 


P2 has too rich a set of possibilities or Pl has been vague. 


12.2. Complementary Roles-of Pl and P2 

Another source of information for clasying ISAS | is the ole, ie, A set of actions 
which may be associated with a dinloree participant. Lint this sitvection. . am considering 
only complementary role relationships, eg. doctor/patient or conductor/passenger.. Roles of | 
dialogue: participants that are not. complementary can.also affect the interpretation of ISAs, 
they are included under properties of Pl and .P2,(Sections 123-and. 12.4)... Like domains, 
roles exhibit. different levels of aggregation. On.an assembly line, a role might be a 
‘relatively small set of actions or-even a singie one, such as tightening a particular. bolt.. The 
rofe for, say, a homemaker could contain many; more.actions, possibly grouped into related — 
‘sets such as cooking, house cleaning, home repair, and.so forth. Here, Aomemaker is what! 
will call a rote name, the word role. will be reserved.for the action :set. 

Roles are particularly important sources of- information about JSAs in. task-oriented 
environments, where formal job descriptions often exist and where labor is often explicitly 
divided. They also come into play in more. informal environments, where tasks and 
responsibilities are divided by custem, by social role;.or by ipneeen: 

‘ An example of an implicit-action ISA based ona complementary role relationship is 
12.2 when addressed to P2, an auto mechanic (a -role: name) at. P2's place of .business. (a 


« domain). 


12.2 The clutch on my van is gone. 


71 


Under the conditions specified, 12.2 can count as a REQUEST that the mechanic fix or 
replace the. auto part, in this case the clutch. This is one of the actions in an auto 
mechanic's role. Utterance 12.2 can also have the force of a simple STATE, for example if 
it is directed to a friend encountered while waiting in line. For this example in this 
environment, it takes the mechanic's -role-to-make-an--appropriate en and hence a 
REQUEST sense, clear. To ask the friend met im tine for a ride, one would generally need 
a more specific REQUEST form in addition to 12.2. 

One question that must be asked about roles is what effect muttiple roles have on the 
interpretation of ISAs. Participants in a dialogue often share more than one complementary 
relationship. While domain invades sie Seite roles (eg., host or hostess ata party), 
other roles are strictly from the relationship of Pl and P2. ‘These multipte relationships do 
not seem to confuse the siiespietanon ISAs as much as one might sie Consider an 
example in which P2 is both PI's das and PI's friend. If Pl deseo: a pain to P2 ata 
party, it is probable that PI is REQUESTing more than ssinaathiy very likely the appeal is 
to P2's special expertise and the implicit action is + Hiagrasia Where fete than one 
sanipleentatg role felationship exists, the deta seems to be to appeal to the role with the 
most powerful action in the area of the REQUEST. This i not to say however: that 


confusion cannot occur, and in this case, a clarifying question can be asked. 


70. This is subject, of course, to constraints. Example 12.2 uttered to a mechanic at a party 
would seldom be seen as a REQUEST to replace a clutch right then and there. This 
reading could be ruled out by the absence of the nexessary tootszand: parts, by the party 
domain (because it conflicts with standard party activities), or by some other condition. 
Note that the first two conditions named correspond to the second blocking question that 
leads to an ASK interpretation in the example worked out in Figure 11. 


12.3. Properties of Pi. - 

While complementary soles: relate ‘to the mutual relationship: of Pi-and- P2,.properties 
of Ph independent -of the relationship cam: also. affect: the interpretation ‘of-an ISA. One 
important case mentioned aboye is the non-mutual role, such: as.an. appeat te the medicad 
expertise of P2 where Pl is not a patient. Another. important class of properties includes the 
fikes and dislekes of Pl. Consider, for example, 123 uttered-as.a REQUEST: 


12.3 1 have to be in Cleveland by 9:00 tomersow morning. : 


Let us us assume that 12. ree is uttered in an noice (a main) to toa secretary (a role name). Let us 
assume further t that the routine response to such a REQUEST in this office is to make 
travel arrangements, in p partial to books a fight if the distance is ST Consider 
the case, however, in which PI is afraid to my ane has made P2 aware of this fact. In this 
case, se, ome other routine at action may b be understood, such as making | train reservations. ‘This 
default would be aisociated with Pi. ge 4 | 

. In general sreperties of PL may block akernative actions sagen by other 
knowlege sources s (and hence mn a disambiguation function), they may sugges tsa 
new alternatives, or they may cause a reordering of enisting shernatives . . 
12.4. Properties of P2 

Properties of P2 that are common knowledge to Pl and P2 can also have an effect on 


the aac aa of an: TSA. « 


14 Table needs anew home, 


If P2 is allergic to cats, Pl knows it, and P2 knows that Pl knows, then 12.4 will not 
(assuming the good will and good memory of Pl) be a- REQUEST that P2.adopt Tabby. 

In general. properties of P2 are of a type and behavior similar to properties of PI. 
There is, however, an added dimension. In deciding whether. an interpretation is affected 
by properties of P2, P2 must determine both whether Pi knows about: the. property and 
whether PI seems-to be aware of it at the time of the REQUEST. To make these decisions, 

2 will usually be operating with highly incomplete knowledge. -If there is any question of 
the right choice, and if anything significant rides on the choice, P2 will often state the 
property (e.g., 12.5 in response to 12.4) as a verification. of the choice. 


12.5 If 1 weren't allergic to cats, I'd be happy'to take Tabby. . 


12.5. Exception Conditions 

The last four subsections have enumerated sources of knowledge, with the stated 
assumption that standard conditions prevailed. .. Interpretations of ISAs can be affected — 
dramaticaily, however, by the existence of exception conditions. : Exception conditions are 
not a totally separate source of knowledge but are integrated into all four categories 
discussed so far: domain, role, and the special properties of Pl and P2. 

‘An example of an appeal to a domain-related. exception condition is 126 uttered at 


hore in a power failure. 
126 It's dark in here. 


If Pl is aware of the power failure, then it is very unlikely that 126 isa REQUEST that P2 


a 


4 


flip the tight switch. Due to the exception ‘céndition, in-this partiastar domain getting a 
flashlight might be the preferred action’(as' well as the one intended by Pl): 

Exception ‘conditions are not ‘limited ‘to domains. An exception condition in a role 
might arise when ‘a piece of machinery is broken and a‘different manufacturing procedure 
must be used. Finally, for Pf and P2. an exception condition might be'a broken leg. 

‘ Exception conditions rot only block or reerder uandard alterndtives; they may also 
bring totally new sets. of alternatives of thetr own. For example, the tise of :a: flashlight, 
while. @ solution to darkness ina standard situation, is mot. a. routine solution in most 
domains. Note that an implicit action, or other aspect of an ISA,-can only be “clear from 
context” in an exception situation when: the exception condition has its.can routine actions 
associated with it. We are still tilting: then, about non-creative, routine actions. The 


difference is that these actions are associated with non-routine conditions. .. 


126. The Course of the: Interaction to Date - 


Fhis brings us to the tast category, the effect-of the: interaction — both linguistic and 


“non-linguistic -- between. Pl and P2. Several different. things may occur here. First, 


information may be gained in the course of the. dialogue that directly pertains to the 
categories already discussed. PI may explicitly specify the domain,.eg. “Let's view this 

problem geometrically.” One of PI's roles may also be specified explicitly, eg., “Speaking as . 
the owner of this property, | want you to know that you're trespassing.” Similarly, Pl- or 


P2-based properties may be discussed explicitly in the dialogue, or an exception condition 


‘may be noted. 


Interpretations of ISAs may also be affected by specific tasks that are now or were 
previously underway in the course of the current interaction, as well as the goal structures of 
Pi and P2 revealed in the dialogue. Tl An example of the effect of specific task is the 
interpretation of exampte 12.7. | 


12.7 We have twenty cartons of paper clips. 


If the domain is an office and the specific task is taking inventory, then 12.7 can be 
interpreted as a simple representative. If the task is rearranging the supplies, then 12.7 
lends itself naturally to interpretation as a REQUEST for'advice on where to put things or 
for help in moving them. An example of the effect of discussion about goals ts 12.8, which 
can be seen as part of an extended exchange related to planning the day. 

12.8 

12.8.1 PI: I think I'll go to the A&P today. 

* F2R2 ° Pe: We're out of flour. : 

It is reasonable to construe P2’s utterance in 12.8 as a REQUEST that PI buy flour on the 
planned shopping trip. To see that this interpretation of 282 is dependent on 12.81, 


compare the following: 


12.9 
12.91 Pf Look at the snow. Let’s bake a cake. 
12.92 P2: We're out of flour. 


The most obvious interpretation of 12.9.2 is as a representative acting as a rejection of the 
suggestion in 12.9.1. 
71. T am: assuming that goals and tasks from previous dialogues arid other shared 


experiences would already be assimilated and would be accessible as properties of the 
participants. 


176. 


- Another way that the interaction to date may affect the interpretation of ISAs is in 
discussion of the preconditions of actions. Consider, for example, 12.10. 

2.10 we Sag we oe A. te Get me 
12.101 Pi: Do you play dominoes? 

12.10.2 PI: I need a partner for the mixed doubles tourmament.on Friday. 

The presence of 12.101 clarifies the iiterptetationiol the particular action in 12102: PI wants 
P2 to agree to be PI's partner-in the tournament... Without this. preliminary remark, the 
particular action would be considerably less clear. Utterance 1210.2 might just as well be a 
REQUEST for advice or for sympathy. 

Although I am not attempting to describe how the dialogue affects the interpretation 
of ISAs, a case could be made for viewing the examples in this subsection as derivable from 
methods (Section 4) and the Event Tree and recognition framework (Section 9). This is a 
natural approach, because the Event Tree is one record af the linguistic and non--linguistic , 
interaction between Pl and P2. Moreover, methods can provide strong connections between 
utterances and strong expectations about cenditions in ‘the environment. We can therefore 
expect methods and the Event Tree to be useful in modelling many (atthough not all) of the 
interactions between PI and P2 that can have an effect on ISA interpretation. 


For Pl to construct a well-formed indirect speech act, there must be a reasonable 
chance that P2 will interpret the utterance as intended. In general, it is only by an appeal 
to shared knowledge that a correct interpretation becomes possible. The six knowledge 


sources discussed in.-this. section give us a start toward a catalogue of the information 


77 


common to dialogue participants (or, more precisely, the information that one reasonably 
expects will be common). The knowledge can come out of shared experience, out of 
communication between participants, or from the standards individuals and institutions 


impose to make certain actions and knowledge “routine”, 


This empty page was substituted for a 
blank page tn the original document. 


13. In Conclusion 


In this paper I have presented an account of a large number of ISA forms within 
task-oriented dialogue. While the account was strongly motivated by computational 
considerations, non-computational arguments were used to justify many of the distinctions 
made. The major characteristics of the approach that I have taken are as follows: 

-- Starting from the observation of traditional speech act theory that language is 

action, | have advocated a uniform representational scheme for speech acts and non- 

linguistic actions. The foundation of the representation is the OWL-I method. 

-- Speech acts and other actions were combined into larger patterns of action, also 

represented by OWL-1 methods. These larger patterns were used as a partial model of 

the structure of dialogue. This gave a framework in which to discuss issues of ISA 
recognition. 

-- The central processing assumption has been that the phenomenon of indirect 

speech acts is too complex to admit to a single, uniform computational treatment. It is 

necessary, then, to identify classes of indirect speech acts that share similar 
computational properties and use different representations and processing strategies 
for the different classes. 

2 I have claimed that a process model of ISA usage needs at least the following: 


|. rules based on general properties of speech acts and other actions 


2. patterns that represent special syntactic and cooccurrence behavior for 
frozen ISA forms 


3. rules that embody relationships iat can be used in referring to implicit 

actions 
There is much more that needs to be done before a full process model of ISAs is 
achieved. First, there are the areas that were explititly excluded, among them, hedged 


performatives and “second order” ISAS, implications of ISA choices, and the generation of 


180 


ISA forms. Second, more in-depth analysis is needed for individual speech acts. Some of 
these that are particularly interesting are the speech acts for stating facts, beliefs, and 
opinions. Despite a great deal of attention from philosophers and some linguists, there is 
still much that can be done in this area. Third, notions of context need to be more fully 
worked out. A fourth major area is reasoning for disambiguation; we are fat from a 
complete understanding of how to block competing, but incorrect, interpretations of ISAs. . 

Finally, | would like to comment on the implications of all of this for human-machine 
dialogue. Judging from the open problems 1 have listed, one might expect that computer 
systems that reliably recognize ISAs would be many decades away. I think, however, that a 
partial solution to the problem is much more accessible to us. Using the speech act 
categorization developed here, it is possible to identify useful subsets of ISAs. This already 
is an advantage, since it is often very difficuk to identify useful subsets of English; | 
| frequently implementers find themselves in what amounts to an “all or nothing” situation 
| Moreover, given the categorization, it will be easy to describe ‘what a system actually does 
for ISAs. Since the categories were justified to a large extent by surface English behavior, 
they should appear natural to speakers of English. Again, this is an area in which 
implementers have had only mixed ‘success. 

Another way in which the implementer's job can be greatly simplified is by carefully 
restricting the speech acts accepted and the tasks available for discussion. (Both restrictions, 
of course, must be made clear to the user.) Limiting the types of speech act accepted means 
that some secondary forces are simply not possible in the environment. With only a few 
tasks handled, it is not as difficult to identify an implicit action; fewer tasks mean that the 


task intended will usually be “clear from context.” 


181 


Thus, while many problems remain to be solved, I feel that not only is a 
computational theory of ISAs well on its way, but we are at the point where ISA processing 


can be done cleanly, 1f not yet completely, in computer dialogue systems. 


This empty page was substituted for a 
blank page tn the original document. 


1} 


{4) 


[5) 


{6] 


7 
[3] 


(9) 


183 
References 


Allen, J. A plan-based approach to speech act recognition, Technical Report No. 131/79 
(February 1979), Department of Computer Science, University of Torento, Toronto, 
Canada. 


Austin, J.L. How to Do Things with Words, Oxford University Press, New York, 1962. 


Brown, GP. A framework for processing dialogue, LCS TR-182 (June 1977), 
Laboratory for Computer Science, M.1.T., Cambridge, MA... 


Brown, G.P. Linguistic and situational context i a: model of task-oriented dialogue, to 
appear in L. Vanya and K JJ. Hintikka (eds): Models of Dialogue, D. Reidel, 
Dortrecht, Holland. 


Bruce, B.C. Belief systems and language understanding, BBN Report No. 2973 
(January 1975), Bolt, Beranek and Newman, Inc., Cambridge, MA. 


Cohen, P.R. On knowing what to say: planning speech acts, Technical Report No. 118 
(January 1978), Department of Computer Science, vpuppiiaees of Toronto, TON, 
Canada. c 


Davison, A. Indirect speech acts and what to do with them, in: Cole and Morgan 
(Eds.) lc and Semantics, vot 3.; Academic Press, New York, 1975. 


Deutsch, B.G. The structure of task oriented diatogs, Pisbiedings IEEE Speech 
Symposium, Carnegie- Mellon University, ade Me PA Api gis 


Fraser, B. Hedged performatives, in: Cole and: Motgan’ (Eds.) Sytex and Semantics, 
vol 3., Academic Press, New York, 1975. 


{10} Gordon, D. and Lakoff, G. Conversational postulates, in: Cole and Morea (Eds.) 


Syntax and Semantics, vol 3., Academic Press, New York, 1975. 


[1] Grice, H:P. Logic and conversation, in: Cole and: er (Eds.) ea and 


Semantics, vol 3., Academic Press, New York,.1975. - 


{12] Grosz, B. The representation and ‘use of focus ih'a system for understanding dialogs, 


Proceedings of the Fifth International or leas ahaa on Cale boise NPI EENES, 
Cambridge, MA (August 1977): te sat din 47 


[13)° Hawkinson, L. The representation of ‘concepts in OWL, Aedvance Papers of the Fourth 


International Joint Conference on Artificigt’ Intelligence, Tbitist, Georgia, USSR 
(September 1975). 


184 


{14] Labov, W. and Fanshel, D. Therapeutic Discourse, Academic Press, New York, 1977. 


(15) Lakoff, R. The logic of politeness; or, Minding your p’s and q's, Papers from the ninth 
regional meeting of the Chicago Linguistic aiaila Mobeni) of awa Department 
of Linguistics, Chicago (1973). 


{16} Levin, J. and Moore, J. Dialogue games: Meta-communication structures for natural 
language interaction, Cognitive Science Voli, No. 4, October. 1977. 


[17] Long, W.J. A program writer, LCS. TR-187 (November 1977), etna for Computer 
Science, M..T., Cambridge, MA. 


(18) Martin, W.A. OWL: A system for building expert problem solving systems involving 
verbal reasoning, 6.871 Course Notes, unpublished, M.I.T., Cambridge, MA (1974). 


{19} Minsky, M. A framework for representing knowledge, in. Winston (Ed.), Visual 
- Information Processing. MIT sa Cambridge, MA 1975. 


{20} Moore, J.A.. Levin, J.A. and Mann, W.C. A goal-oriented model of human ainlog ue, 
American. Journal of Computational Linguistics, Micsofiche 67:(1977). 


[21] Morgan, ji. Two types of convention in indirect speech acts, in: Cole (Ed.) Syntax 
and Semantics, vol. 9, Academic Press, New York, 1978. 


[22] Perrault, CR. Allen, }. and Cohen, P. Speech acts asa basis for understanding 
dialogue coherence, Theoretical Issues in Natural Language Processing-2, University 
of Htinois at Urbana-Champaign, Urbana, IL (July 1978). 

{23] Rubin, A. A framework for comparing language experiences, Theoretical Issues in 
Natural Language Processing-2, University. of nina at Urbana-Champaign, 
Urbana, IL (July 1978). 


{24] Sadock, J.M. Toward a Linguistic nee of Speech Acts, Academic Press, New York, 
1974. 


£25} Sadock, J.M. On testing for conversational ae in: Cole (Ed.) Syntax and 
Semantics, vol. 9, Academic Press, New York, 1978. a 


(26) Sag, Land Liberman. M. The intonational disambiguation. of. indirect speech acts, 
Papers from the ninth regional meeting of the ae sage ies mee of 
Chicago Department of Linguistics, Chicago (1979)... 


. [27] Schank, R. and ‘Abelson, R.. Scripts, Plans, Goals,.and. Understanding, Lawrence 
, Erlbaum Associates, Hillsdale, NJ, 1977. 


185 


[28] Schegloff, E. and Sacks, H. Opening up closings, in: Turner (Ed.) Ethnomethodology, 


Penguin Books, Middlesex, England, 1974. 


[29] Searle, J.R. Speech Acts, University Press, Cambridge, 1969. 


[30] 


(31] 


. (32) 


[24] 


(35) 


{36} 


Searle, ].R. Indirect speech acts, in: Cole and Morgan (Eds.) Syntax and Semantics, vol 
3. Academic Press, New York, 1975. 


Searle, J.R. A taxonomy of illocutionary acts, in: Gunderson (Ed.), Language, Mind, 
and Knowledge, University of Minnesota Press, 1976. 


Swartout, W.R. A digitalis therapy advisor with explanations, Proceedings of the Fifth 
International Joint Conference on Artificial Intelligence, Cambridge, MA (August 
1977). 


Szolovits, P., Hawkinson, L., and Martin, W.A. An overview of OWL, a language for 
knowledge representation, Proceedings of the Workshop on Natural Language for 
Interaction with Data Bases, International Institute for Applied Systems Analysis, 
Laxenburg, Austria. 


Verschueren, J. Speech act theory: A provisional bibliography with a terminological 
guide, (May 1976) Indiana University Linguistics Club, Bloomington, IN. 


Verschueren, J. The analysis of speech act verbs: Theoretical preliminaries, (August 
1977) Indiana University Linguistics Club, Bloomington, IN. 


Zwicky, A.M. and Sadock, J.M. Ambiguity tests and how to fail them, in: Kimball 
(Ed.) Syntax and Semantics, vol. 4, Academic Press, New York, 1975. 


This empty page was substituted for a 
blank page tn the original document. 


187. 


Appendix A. Core Dialogue Methods 


This appendix contains a list of some of the core dialogue methods associated with 
the speech acts from Section 3. Along with the list of core methods is a description of the 
OBJECT case specification as well as a description of the major standard path steps. The 


actual representation of these methods is done in OWL-I. 


ASK-AND-ANSWER 
OBJECT: a what-, where-, who-, whether-, or mena estion x to be answered 
MAJOR STANDARD PATH STEPS: fe os 
PI ASKs the question. 
P2 evaluates the query to decide whether to comply. 
P2 finds the answer. 
P2 gives the answer. 


ASK-AND-DESCRIBE 
OBJECT: a thing or how-question /2 
MAJOR STANDARD PATH STEPS: 
Pl ASKs what the description is. 
P2 decides whether to comply. 
P2 gives the description. 


ASK-AND-EX PLAIN 
OBJECT: an action, a how-question, or a why-question 
MAJOR STANDARD PATH STEPS: 
PI ASKs what the explanation is. 
P2 decides whether to comply. 
P2 gives the explanation. 


72. How-questions are split between ask-and-describe and ask-and-explain depending on the 
type of information that. seems appropriate. Of course. ask-and-describe and ask-and- 
explain can also be. triggered :by: a: direct sequest for a description ar. explanation, 
" respectively. The motivation for distinguishing: ask-and-describe and ask- and-explain from 
ask-and-answer ‘is that the first two: will tend to be. invelved with longer answers. that 
require: more selection and organization of the information. Ask-and-describe and. ask-and- 
explain are distinguished: from each: other. by the aspects-of the topic. that. are. considered 
relevant in answering a. oe for.. ask- peu raite the. eemebart: is on causal 
relationships. see 


SIDA NE 


188 


REQUEST-AND-HELP 
OBJECT: an action that can be the object of REQUEST 
exception: a repetitive action 
MAJOR STANDARD PATH STEPS: 
PI REQUESTs P2 to take responsibility for a 
Pl with an action. 
P2 decides whether to ra 
in either order: 
P2 ACKNOWLEDGES the REQUEST. 
If not already understood, roles in the action are assigned 
to the participants. 
then: 
The action is carried out. 


SUGGEST-AND-ACCEPT”* 
OBJECT: an action 
MAJOR STANDARD PATH STEPS: 
Pl SUGGESTS the action. 
P2 decides whether to comply. 
P2 ACCEPTs the suggestion. 


73. Note that the action suggested is not pepreservett as a step in.. SUGGEST-AND- 
ACCEPT. This ig because | see suggestions a5 directed:toward. acceptance of an idea, not 
directly toward thé’ ‘accomptishinent ‘oF actions (henee the possible response,. “That's a good 
idea”), There i, however, a close relation between a suggestion!to:do an action and the 
action actually being carted out. “To represent this, we carsay that there is « general sacial 
directive that people Stiout! behave: reasonably: Tf there is geod: denson: to carry out an 
action (and no overriding objection to it) then the action should -be carried out. This 
information would be incorporated into processes for selecting goals. 


189 


Appendix B. Matching Semantic Representations against Rules 


Although the ISA rules in this paper have been presented informally, | am assuming 
a highly restricted rebssignship between the patterns specified by the ISA rules and the 
semantic representations of particular English utterances. This appendix summarizes the 
‘eiations kgs | | 

First, we must consider the level of representation. ‘The dialogue model Iam 
assuming assigns. more than one semantic representation to an utterance. (These 
representations are ‘ipeascines in Appendix E.) I,wish, however, to make my remarks on 
matching as independent as pessible of the particular choice, of representation level for rules 
and utterances. For the purpose of explanation, it is assumed. that the two representations 
are at the same level or close clade so that ihe maegenniotes between elements can be 
easily identified. | | 

This brings 4s to the constraints: on matching.. In: saying that a. semantic 
representation S matches a pattern in an ISA rule, | mean that the elements of S correspond. 


to the elements of the pattern by any one of the following relationships: 


J. . An element of S matches an element of the pattern exactly. 


2. An element of S has a subclass relationship to the element of the pattern. (I am 
assuming that the “class” relationship in “subclass” and “superclass” below is OWI.-1 
specialization. See Hawkinson {13}. The important point here.is that the two elements 
are related to each other by an explicit classification link in the knowledge base.) 


3. An element of S has a superclass relationship to the element of the pattern. 
4. An element of S may match a variable in the ISA pattern either by relationships 


(1)-(3) or by being an instance of the variable type. :.In this paper. variables are 
signalled by the use of some, a, or the in the statement of rules and preconditions. The 


indicates the case where a variable-must-bé matched by an instance. A and some are. 
used where either an instance match may occur or retationships (1)-(3) may be 

_ appealed to. 

. 5. Phrases in rules and speech act preconditions appearing in this paper in 
parentheses dré not expected to ‘have corresponding “Clements in utterance - 
representations. 

6. One of the relationships (2), (3), or (4) apply, and added constraints on what may 
match the pattern are either given explicitly in the pattern or supplied by ‘the other. 
parts of the match that have been completed. 
The third relationship above i necessary, but it #s ‘Significantly fess common than the first « 
two. Since parts of the utterance’are itiore general thaw the palterfi, it these cases context 
plays a role in determining that the’ more specific element of the pattern is being referred to.’ - 
To iflustrate the fourth matching relationship; cohsider the following examples: 
B.1 Something good is bound to come of it if you-buy-a' dollar ticket. < 
B.2 You'll have a good chance of winning if you buy a dollar ticket. 
Examples B.I and B:2 can be interpreted as REQUESTS derived from Rule 6.2, repeated 


PI can convey a speech act indirectly Y , 
-- a representative with the ipropositiowat' colitent’ that some desirable result or 
results can be expected for some mended ae of ee aia 
Under this interpretation, the parts of the ¢ example in rade: would orespond to the phrase 
“some desirable result or results” in the baled The italicized cause in ie istece rai related to 
the rule by one of the alia 4) above, depending ¢ on iba stand a one took on the 


relationship beween the good and fhe desirable. The hated ewe | be B2 -woukd be 


considered as conveying an instance of the phrase in the rule, since winning is one possible 
desirable result. 

In this account of matching, there is still a missing piece: the actual relationships 
between particular representations. In keeping with the informality of the rules that I am. 
presenting, | do not attempt a taxonomy of representations (ie. hierarchic structures froma 
world model). 74 In addition, | make no attempt to characterize the features of English that 
make some realisations of a rule more idiomatic ian others. My purpose in discussing 
matching is to show that, despite the informality of the presentation, I am expecting the 


matching process to be a relatively straightforward, highly constrained operation. 


74. Two expressions that come up repeatedly in the statement of the rules, however, are 
worth noting. [ assume that believe has a superclass relationship to all idea-holding 
concepts, e.g. thinking, knowing, assuming, hypothesizing, etc. Want has a superclass 
relationship to all goal-holding concepts, eg., desiring, needing, etc. 


This empty page was substituted for a 
blank page tn the original document. 


Appendix C. ISA Examples for Rules 6.1 to 6.9 


The rules restated: 
Pi can convey a Speech act indirectly by -- 
Rule 61 
{i) ASKing whether the intended soci act is necessary 


(ii) ASKing whether an equivalent speech act Me one with the same pees 
intended effect) has aeedy been ne ae 


(iti) ASKing whether the principat intended effect can be expected to occur without the 
speech act 


(iv) ASKing whether the principal intended effect of the speech act has atready 
occurred 


Rule 6.2 


-- a representative with the propositional content that some desirable resuft or results 
can be expected for some intended effect of the speech act. 


Rule 6.3 (for Pi-based preconditions): 
-- a representative of the topic of a Pl-based precondition of the speech act. 
Rule 6.4 (for P2-based preconditions): 
-- an ASK of the topic of a P2-based precondition of the speech act. 
Rule 65 (for unmarked preconditions): 
-- an ASK of the topic of an unmarked precondition of the speech act. 
This rule applies in a context where PI believes P2 has better knowledge of the 
condition in the precondition topic. 
Rule 66 (for unmarked preconditions): 
| -- a representative of the topic of an unmarked precondition of the speech act. 


This rule applies in a context where Pi believes Pl has better knowledge of the 
condition in the precondition topic. 


Rule 6.7 (for unmarked preconditions): bs 
-- a representative of an unmarked precondition of the pny act where PI believes he 
or she has better knowledge of the condition. 


Rule 6.8 (for groups of preconditions): 


-- a REQUEST form of an action that is a goal of Pl Ge,.A for, "Pl. wants A’). This,. 
rule is applicable only when the speech act has preconditions that are exact matches or 
specializations of the four preconditions of REQUEST. 


Rule 6.9 


-- an ASK about whether P2 will take: praperaeig Aas carrying out an “active” action 

that is a goal of Pl (ie, A for "PI wants A”). - 

This rule is applicable only when the speech act has preconditions that are exact 
~ patches or specializations of the four preconditions of REQUEST... 


195 


ASK 
propositional content: some question 
simple form: <interrogative> e.g., Where's the mustard? 


principal intended effect: that P2 tell Pl the answer to the question in the propositional 
content : 


Rule 61 
-i- Do | need to ask you where the mustard is? 
-ii- Did T ask you where the mustard is? 

-iii- Do you intend to tell me-where the mustard is? 
-iv- Did you tell me where the mustard is? 


Rule 6.2 
-- I'll be able to finish these sandwiches if you tell me where the mustard is. 


Precondition-based Examples 


I. Pl wants to know the answer to the question. 

-R6.3- I want to know where the mustard is. 

Il. Pl believes that P2 can tell the answer to the question. 
-R6.5- Can you tell me where the mustard is? 

-R66- You can tell me where the mustard is. 

-R6.7- 1 believe you can tell me where the mustard is. 


HII. Pl believes that P2 is willing to tell the answer to the question. 


-R6.4- Would you be willing to tell me where the mustard is? 
-R6.4- Do you want to tell me where the mustard is? 


IV. Pl wants P2 to tell Pl the answer to the question. 
-R6.3- I'd like you to tell me where the mustard is. 


V. Pl believes that P2 has some obtigation (a role obligation, authority obligation, or 
general obligation to be helpful) to Pt to tell Pl-the answer to the question. 


-R6.5- GAP? 

-R6.6- You ought to tell me where the mustard is. 

-R66- It’s your obligation as a member of this household to tell. me where the _ 
mustard is. ee te ee we 
-R6.7- 1 think you should tell me where the mustard is. 


If-V, together: 


-R6.8- Tell me where the mustard is. 
-R6.9- Will you tell me where the mustard is? 


75, Although it is possible to construct contexts in which this form can be used, it seems to 
be only marginal. Forms such a3 "Shouldn't you-tell_me ..7°.and "Don’t you think you 
should tell me..2” are far more common. (See Section 6.3.1 for discussion of these fatter 
forms.) Other forms derived from application of :Rule 6.3.t0 ASK V. do not convey an — 
ASK of the propositionat content at alt; for example, MMust-you tell. me _.2". is a REQUEST 
that P2 refrain from telling. Perhaps Rule 65 does not apply to ASK V. because PI is 
assumed to know about any obligations that P2 has to Pl, so that there is no reason to 
question P2. 


197 


OFFER 
propositional content: some action that PI believes will be of benefit to P2 
simple form: none 


principal intended effect: that P2 a PI's commitment to take responsibility for the action 
in the propositional content 


Rule 61 
-i- ?Do I need to offer you a ride to the airport? © 
-ti- Has anybody offered you a ride to the airport? 
-iii- GAP?? 
-iv- Have you accepted a ride to the airport? 


Rule 


B 


-- I'd feel a lot better if you'd accept a ride to the airport. 
Precondition-based Examples 

I. P! wants to take responsibility for the action. 

-R6.3- J want to drive you to the airport. 

II. Pl believes that Pl can take responsibility for PI's part of the action. 

-R6.5- Can I drive you to the airport? 

-R6.6- I can drive you to the airport. 

-R6.7-_ 1 assume I can drive you to the airport. 


Ill. Pl is willing to take responsibility for Pl's part of the action. 


-R6.3- I'm more than willing to drive you to the airport. 
-R63- I'd be glad to drive you to the airport. 


IV. Pl wants P2 to perform some action that complements Pl's part of the action. 
-R6.3- } want you to accept a ride to the airport. 


76. This form seems to be marginal due to the conflict between its angry connotations and 
the level of politeness involved in an OFFER. 


71. This gap is explained by the fact that the principal intended effect for OFFER can be | 
brought about only by the speech act; there is no independent m means of achieving it.. Thus, 


Clause iti of Rule 6.1 can never hold. 


V. PI believes P2 can perform some action that complements PI's part of the action. 


-R65- Can'you acceptia ride to the airport?: 
-R66- You can accept a ride to the airport. 
-R6.7- | assume you can accept a ride to the airport. 


VI. Pl betieves that P2. would. be willing: to perform some action that Spmpnents 
PI's part of the action. shes 


-R6.4- Would you be willing to al a ride to the airport? 


VII. Pl believes that P2 has an obligation (to “be belptut’) to Pl perm some action 
that complements PI's part of the actton. 


-R6.5- GAP 

-R6.6- You must accept a ine to the airport. 

-R6.7- GAP at 
VII. Pt betieves that P2 ae an obligation to P2 (by virtue of Pos own self-interest) 
to perform some action that i ae P's part of the.action.. _ , 


-R65 GAp7s 

-R66- That suitcase is heavy. You should let me » drive you to the airport. 

-R6.7- That suitcase is heavy. I think you should let me drive you to the airport. 
IV.-VI1. together: 


“R68- Please accept a ride to the airport. 
-R6.9 Will you accept a ride to the niaakn 


78. See discussion for ASK V. 


199 


SUGGEST 


propositional content: an action 
except for: actions in which Pl and P2 share common agency 


simple forms: 
1. What about <action>? eg. What about joining the Marines? 
- 2. How about <action>? eg., How about joining the Marines? 


principal intended effect: that P2 consider taking responsibility for the action in the 
propositional content. 


Rute 61 
-t- Need 1 suggest that you jain the Marines? 
-ii: Has anyone suggested that you join the Marines? 
-iii- Are you thinking about joining the Marines? 
-iv- Have you considered joining the Marines? 


Rule 6. . 
-- I'd be pleased if you'd consider joining the Marines. 


to 


Precondition-based Examples 
1. Pl wants P2 to consider taking responsibility for the action. 
- -R63- T want you to think about joining the Marines. 
H. Pi believes that P2 can consider taking responsibility for the action. 
-R6.5- Could you think about joining the Marines? 
-R6.6- You could think about joining the Marines. 
-R6.7- I think you could consider joining the Marines. 


HI. Pl believes that P2 is willing to consider taking responsibility for the action. 


-R6.4- Are you willing to consider joining the Marines? 
-R6.4- Do you want to think about joining the Marines? 


1V. PI believes that P2 has an obligation (to “be helpful") to Pi to consider the 
action. 

-R65- GAP/9 

-R6.6- You should think about joining the Marines. 


79. See discussion for ASK V. 


200 


-R6.7- I think you should consider joining the Marines. 
V. PI believes that P2 can take responsibility for the action. 


-R65- Can you join the Marines? ears 
* -R66- You can join the Marines. 
-R6.7- 1 assume you can join the a ge 


‘ spe tg 4 


VI. PI believes that PF is willing to ‘ate ieiadbinny tet the’ action. 


-R6.4- Are you willing to join the Marines? 
-R6.4- Would you be willing to join the Marines? 


VII. PI believes that there are some reasons why the action is desirable. 


-R6.5- Would it be good for you to} join the ‘Marines? Feet R sos 

-R66- It would be good for you to join the’ Marines. ok bs 

_-R66- You'd be a credit to your soragity ' ify you Joined the. Marines. . 
-R6.7-_ I believe it would be good for you to join the Marines.” 


VIII. Pi believes that P2 has an, obligation to P2 (by. virtue of P's own self-interest) 
to consider taking responsibility for the‘action. | 


-R65- GAPS? 

-R6.6- You need a new experience. You should join the Marines. _ 
-R6.7- You néed a new experience. I think you should ie ‘the Marines. 
1.-1V. together: - 


-R68- Think about joining the Marines. 
-R6.9- Will you consider joining the Marines? _ 


80. See discussion for ASK V. 


201 


The notion of a basis for action defined in Section 7.5.3 can be related to the general 
structure of actions discussed in Section 4. By doing so, we get an additional level of detail 
for the definition of “basis for action” and hence a sounder understanding of Rule 76 and 
of type 3 implicit-action forms. Moreover, it should be easy to see that the additional level 
of detail has strong relevance for a computational treatment of these forms. I start with a 
look at undesirable states and then go on to look at signals and symptoms. 

Undesirable states that refer to ongoing methods (i.e. actions in progress) correspond 
to method failure conditions. For example, D.1, which reports a failure in an ongoing 


copying process may be used to REQUEST that P2 fix the copier. 
D1 The paper jammed. 


Other undesirable states (eg. “It's cold in here.”) will match the negation of the 
PRINCIPAL-RESULT or other results of some method known to P2. (We expect the 
method(s) to be known to P2 due to PI's obligation to avoid ambiguity; see Section 6.1.) In 
addition, I suspect that very frequently undesirable states will correspond to an initial 
condition®! in a special planning method. For example, there might be a special planning 
method that contained information on “what to do when the room is cold,” including 
alternative actions and the tradeoffs between them. (For further discussion of these 


methods, see Section 12.) 


81. This could possibly be represented in a semantic input case called INITIAL- 
CONDITION which would be defined as a specialization of SOURCE, ie, the point 
started from. 


202 


This takes care of the relationship of undesirable states to OWL-I methods. The 
handling of signals and symptoms differs according to whether or not they are related to 
methods already underway in the dialogue encounter. ‘Signats ‘and symptoms . related to 
ongoing methods will appear in a conditional step. ‘For example, a cake-baking method 


that uses an oven timer will have a conditional step corresponding to the following: 
When the buzzer of the oven timer goes off, check to see if the cake is done. 


That ‘is, given the signal, perform an action. Simitarty, whether er not a cake method uses 


a signal, the following sort of conditional step might appear: - 
When the cake starts to smell, check to see if it is done. 


Again the same type of pattern: given a symptom, perform an action. in addition to the . 
simple presence of the states and actions in the method, we might want to fabel them 
explicitly as symptoms or signals. Note that the conditional steps referred to in iiss 3. 
implicit-actions may appear not only in the -most “generat ‘ongoing ‘methods but also in 
substeps of them (which are themsetves, Of course, ottgoing methods). Thus, not only the 
steps in the bake-cake method might be used as a basis for type 3°ISAs but also steps in the 
mix-batter or grease-pan methods. 

For signals and symptoms unrelated to ongoing methods, it appears that one needs an 
additional set of representations in the knowledge base. These would: contain information . 
such as that for the following example signal: 


The fire alarm is a signal of a fire or a drift. 
When it goes off, leave the building immediately. 


203 


Le <signal> is a signal of <event or process>. 
When <signal> happens <carry out an action>. 


A comparable example for symptoms is: 


Pl shivering is a symptom of PI's being cold. 


Le. <symptom> is a symptom of <state> 


‘Note that symptoms, especially, are not necessarily one-to-one. That is, shivering may also 
be a symptom of other states, eg. fright. Second, I am expecting the fact that it is 
undesirable for a person to be cold to be independently determinable from other 


information in the knowledge base. 


wate 


In summary, to relate type 3 implicit-action forms to OWL-I methods, we have called 
on a number of different structures. Failure conditions, PRINCIPAL-RESULTs and other 
results, and states in conditional steps were used to represent the undesirable states, signals, 
and symptoms ee Rule 7.6. In addition, independent structures were suggested for some 


uses of signals and symptoms. 


This empty page was substituted for a 
blank page tn the original document. 


205 


Appendix E: A Summary of Representation Levels in the General Recognition Model 


The system configuration discussed in Section 9.1 presupposes various types of 
semantic representations. In this appendix | collect the different. representations and show 
briefly how they relate to each other. This list is suggested as the minimal set of 


representations for a general recognition process. 
Surface Semantic Representation (SSR) 


This representation output by the parser closely follows the surface English. utterance. Its 


salient features are: 


-- Noun group references not needed for the parsing process may remain unresolved 
(eg., “I saw Aim"). 


-- Systematically ambiguous relationships between constituents may be unresolved if 
not needed for parsing. For example, "He was hit by the door.” Here, by is 
systematically ambiguous between the agent or cause of the action. in a passive 
construction (i.e. the door made contact with the person) and a location marker (ie. 
the door is viewed as a place). The fact that the. focative is less likely as an 
interpretation does not mean that it can be ignored as a possible reading. 


-- Choices between word senses need not be made unless, again, they are forced by the 
parsing process. For example, in “I get it" get may be synonymous with understand, 
or, if it refers to a journal, get could be synonymous with receive or even. subscribe to. 
The SSR, however, would contain only a concept corresponding to get unless further 
specialization were necessary for a completed parse. 


-- ISA forms are preserved. That is, "Can you open the window?" would always have 
a SSR that recorded its interrogative nature and that contained a concept 
corresponding to can. Representation of any REQUEST intentions would not occur 
at this level. 


Interpreter: Levet Representation (ILR) 


‘This is the main level of representation- used by the Interpreter; methods are among 
the structures represented at this level. ILR:has the following characteristics: 

-- It is a case-frame oriented representation. The core set of cases are described in 

Section 4. ie Tae 

-- Among its constituent concepts ILR may contain variables. The use of these is 

discussed for the next representation type. 


Evaluated ILR 


This is an ILR with the appropriate other pieces of ILR bound to its variables. 
Evaluation allows general pieces of ILR. to be related to current environments. Evaluated 
ILR is output by the module Evaluate and is used, among other places, in the structural 
expectations (see Section 9.2). 

| ‘For example, if an order-fast-foed procedure step were. evahyated for three Big 
' Burgers and a person catied. Julia, then the concept for Julia would be bound to the 
variable for the AGENT of the call, and the concept for the Big Burgers would be bound 
to the variable in the OBJECT specification.. We thus get a transformation from one 
iepitssenstion to another with roughly the follewing information content: : 
RI: AGENT (a person) orders fast food from the CO-AGENT (a person) 
R2: Julia orders three Big Burgers from Counter-person-1 
The evaluated call representation of R2 conibe used to search for an appropriate method to 


carry out the procedure. 


207 
Transitional Representation 


Transitional representation is the level of representation that appears in special 
matching patterns, sie of which are discussed in Section 11.82 This representation is 
called “transitional” because it bridges the gap between SSR and ILR. Transitional 
representations share case frame structuring and the use of variables with ILR, but they 
share sentence-type marking (declarative, interrogative, imperative) and indirect forms with 
SSR. Each transitional representation has an associated ILR, and a successful match of a 
SSR against the transitional representation (eg, by the reference matching process) 


automatically binds the variables in the ILR. 


Other Representations 


It is quite possible that some semantic domains will require other sorts of 
representations. While the units of the current ILR are not to be construed as lexical items, 
ILR is intentionally linguistically oriented. There would no doubt be a need for quits: 
different representations, eg. tabular information and mathematical formulas. The 
representations summarized in this section, then, are seen as a minimum, not necessarily a 


complete set. 


82. This representation type was called “subsurface level representation” in earlier versions 
of the model. 


